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VASCULAR ENDOTHELIAL GROWTH FACTOR-X 

The present invention is concerned with a novel 
vascular endothelial growth factor (VEGF) herein 
5 designated "VEGF-X", and characterisation of the 
nucleic acid and amino acid sequences of VEGF-X. 

Introduction 

10 Angiogenesis involves formation and proliferation of 
new blood vessels, and is an essential physiological 
process for normal growth and development of tissues 
in, for example, embryonic development, tissue 
regeneration and organ and tissue repair. 

15 Angiogenesis also features in the growth of human 

cancers which require continuous stimulation of blood 
vessel growth. Abnormal angiogenesis is associated 
with other diseases such as rheumatoid arthritis 
psoriasis and diabetic retinopathy, 

20 

Capillary vessels consist of endothelial cells which . 
carry the genetic information necessary to proliferate 
to form capillary networks. Angiogenic molecules 
which can initiate this process have previously been 

25 characterised. A highly selective mitogen for 

vascular enothelial cells is vascular endothelial 
growth factor (VEGF) (Ferrara et al., "Vascular 
Endothelial Growth Factor: Basic Biology and Clinical 
Implications". Regulation of angiogenesis, by I.D. 

30 Goldberg and E.M. Rosen 1997 Birkhauser Verlag 

Basle/Switzerland) . VEGF is a potent vasoactive 
protein which is comprised of a glycosylated cationic 
4 6-4 9 kd dimer having two 24 kd subunits. It is 
inactivated by sulfhydryl reducing agents and is 

35 resistant to acidic pH and to heating and binds to 
immobilised heparin. 
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VEGF-A has four different forms of 121, .165, 189 and 
20 6 amino acids respectively due to alternative 
splicing. VEGF121 and VEGF165 are soluble and are 
capable of promoting angiogenesis, whereas VEGF189 and 
5 • VEGF206 are bound to heparin containing proteoglycans 
in the cell surface. The temporal and spatial 
expression of VEGF has been correlated with 
physiological proliferation of the blood vessels 
(Gajdusek, CM., and Carbon, S.J., Ceil Physiol., 

10 139:570-579, (1989)); McNeil, P.L., Muthukrishnan, L., 
Warder, E., D'Amore, P. A., J. Cell. Biol., 109:811- 
822, (1989) ) . Its high affinity binding sites are 
localized only on endothelial cells in tissue sections 
(Jakeman, L.B., et ai., Clin. Invest. 89:244-253 

15 (1989)), The growth factor can be isolated from 

pituitary cells and several tumor cell lines, and has 
been implicated in some human gliomas (Plate, K.H. 
Nature 359:845-848, (1992)). The inhibition of VEGF 
function by anti-VEGF monoclonal antibodies was shown 

20 to inhibit tumor growth in immune-deficient mice (Kim, 
K.J., Nature 362:841-844, (1993)). 

VEGF proteins have been described in the following 
patents and applications all of which are hereby 

25 incorporated by reference EP-0, 506, 477, WO-95/24473, 
WO-98/28621, WO-90/13649, EP-0 , 47 6, 983, EP-0 , 550, 296, 
WO-90/13649, WO-96/26736, WO-96/27007, WO-98/49300, 
WO-98/36075, WO-98/840124 , WO-90/11084, WO-98/24811, 
WO-98/10071, WO-98/07832, WO-98/02543, WO-97/05250, 

30 WO-91/02058, WO-96/39421, WO-96/39515, WO-98/16551. 

The present inventors have now identified a further 
vascular endothelial growth factor, designated herein 
as W VEGF-X", and the nucleic acid sequence encoding 
35 it/ which has potentially significant benefits for the 
treatment of tumours and other conditions mediated by 
inappropriate angiogenic activity. 
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Summary of the Invention 

In the present application, there is provided a novel 
vascular endothelial growth factor, herein designated 
5 "VEGF-X", nucleic acid molecules encoding said growth 
factor, an expression vector comprising said nucleic 
acid molecule, a host cell transformed with said 
vector and compounds which inhibit or enhance 
angiogenesis. Also provided is the sequence of a CUB 
10 domain present in the sequence of VEGF-X which domain 
itself prevents angiogenesis and which is used to 
treat diseases associated with inappropriate 
vascularisation or angiogenesis. 

15 Detailed Description of the Invention 

Therefore, according to a first aspect of the present 
invention there is provided a nucleic acid molecule 
encoding a VEGF-X protein or a functional equivalent, 

20 fragment, derivative or bioprecursor thereof, said 
protein comprising the amino acid sequence from 
position 23 to 345 of the amino acid sequence 
illustrated in Figure 10. Alternatively, the nucleic 
acid molecule of the invention encodes the complete 

25 sequence identified in Figure 10 and which 

advantageously includes a signal peptide to express 
said protein extracellularly. Preferably, the nucleic 
acid molecule is a DNA and even more preferably a cDNA 
molecule. Preferably, the nucleic acid molecule 

30 comprises the nucleotide sequence from position 257 to 
12 91 of the nucleotide sequence illustrated in Figure 
9. In a preferred embodiment the nucleic acid is of 
mammalian origin and even more preferably of human 
origin, 

35 

In accordance with the present invention a functional 
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equivalent should be taken to mean a protein, or a 
sequence of amino acids that have similar function to 
the VEGF-X protein of the invention. 

5 Also provided by this aspect of the present invention 
is a nucleic acid molecule such as an antisense 
molecule capable of hybridising to the nucleic acid 
molecules according to the invention under high 
stringency conditions, which conditions would be well 
10 known to those skilled in the art. 

Stringency of hybridisation as used herein refers to 
conditions under which polynucleic acids are stable. 
The stability of hybrids is reflected in the melting 
15 temperature (Tm) of the hybrids. Tm can be 
approximated by the formula: 

81 . 5°C+16. 6 (log 10 [Na*] +0 .41 (%G&C) -600/1 

20 wherein .1 is the length of the hybrids in nucleotides. 
Tm decreases approximately by 1-1. 5°C with every 1% 
decrease in sequence homology. 

The term ^stringency" refers to the hybridisation 
25 conditions wherein a single-stranded nucleic acid 

joins with a complementary strand when the purine or 
pyrimidine bases therein pair with their corresponding 
base by hydrogen bonding. High stringency conditions 
favour homologous base pairing whereas low stringency 
30 conditions favour non-homologous base pairing. 

"Low stringency" conditions comprise, for example, a 
temperature of about 37°C or less, a formamide 
concentration of less than about 50%, and a moderate 
35 to low salt (SSC) concentration; or, alternatively, a 
temperature of about 50°C or less, and a moderate to 
high salt (SSPE) concentration, for example 1M NaCl. 
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^High stringency'' conditions comprise, for example, a 
temperature of about 42°C or less, a formamide 
concentration of less than about 20%, and a low salt 
(SSC) concentration; or, alternatively, a temperature 
5 of about 65°C, or less, and a low salt (SSPE) 
concentration. For example, high stringency 
conditions comprise hybridization in 0.5 M NaHP0 4 , 7% 
sodium dodecyl sulfate (SDS), 1 mM EDTA at 65°C 
{Ausubel, F.M. et al. Current Protocols in Mo lecular 
10 Biology , Vol, I, 1989; Green Inc. New York, at 
2.10.3). 

"SSC" comprises a hybridization and wash solution. A 
stock 20X SSC solution contains 3M sodium chloride, 
15 0.3M sodium citrate, pH 7.0. 

"SSPE" comprises a hybridization and wash solution. A 
IX SSPE solution contains 180 mM NaCl, 9mM Na 2 HP0 4 and 
1 mM EDTA, pH 7.4. 

20 

The nucleic acid capable of hybridising to nucleic 
acid molecules according to the invention will 
generally be at least 70%, preferably at least 80 or 
90% and more preferably at least 95% homologous to the 
25 nucleotide sequences according to the invention. 

The ahtisense molecule capable of hybridising to the 
nucleic acid according to the invention may be used as 
a probe or as a medicament or may be included in a 
30 pharmaceutical composition with a pharmaceutically 
acceptable carrier, diluent or excipient therefor. 

The term "homologous" describes the relationship 
between different nucleic acid molecules or amino acid 
35 sequences wherein said sequences or molecules are 

related by partial identity or similarity at one or 
more blocks or regions within said molecules or 
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sequences. 

The present invention also comprises within its scope 
proteins or polypeptides encoded by the nucleic acid 
5 molecules according to the invention or a functional 
equivalent, derivative or bioprecursor thereof. 

Therefore, according to a further aspect of the 
present invention, there is provided a VEGF-X protein, 

10 or a functional equivalent, derivative or bioprecursor 
thereof, comprising an amino acid sequence from 
position 23 to 345 of the sequence as illustrated in 
Figure 10, or alternatively which amino acid sequence 
comprises the complete sequence of Figure 10, A 

15 further aspect of the invention comprises a VEGF-X 
protein, or a functional equivalent, derivative or 
bioprecusor thereof, encoded by a nucleic acid 
molecule according to the invention. Preferably, the 
VEGF-X protein encoded by said nucleic acid molecule 

20 comprises the sequence from position 23 to 345 of the 
amino acid sequence as illustrated in Figure 10, or. 
which sequence alternatively comprises the sequence of 
amino acids of Figure 10. 

25 The DNA molecules according to the invention may, 

advantageously, be included in a suitable expression 
vector to express VEGF-X encoded therefrom in a 
suitable host. Incorporation of cloned DNA into a 
suitable expression vector for subsequent 

30 transformation of said cell and subsequent selection 
of the transformed cells is well known to those 
skilled in the art as provided in Sambrook et al. 
(1989), molecular cloning, a laboratory manual, Cold 
Spring Harbour Laboratory Press. 

35 

An expression vector according to the invention 
includes a vector having a nucleic acid according to 
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the invention operably linked to regulatory sequences, 
such as promoter regions, that are capable of 
effecting expression of said DNA fragments. The term 
"operably linked" refers to a juxta position wherein 
5 the components described are in a relationship 

permitting them to function in their intended manner. 
Such vectors may be transformed into a suitable host 
cell to provide for expression of a polypeptide 
according to the invention. Thus, in a further 

10 aspect, the invention provides a. process for preparing 
polypeptides according to the invention which 
comprises cultivating a host cell, transformed or 
transfected with an expression vector as described 
above under conditions to provide for expression by 

15 the vector of a coding sequence encoding the 
polypeptides, and recovering the expressed 
polypeptides. 

The vectors may be, for example, plasmid, virus or 
20 phage vectors provided with an origin of replication, 
and optionally a promoter for the expression of said 
nucleotide and optionally a regulator, of the promoter. 

The vectors may contain one or more selectable 
25 markers, such as, for example, ampicillin resistance . 

Regulatory elements required for expression include 
promoter sequences to bind RNA polymerase and 
transcription initiation sequences for ribosome 

30 binding. For example, a bacterial expression vector 
may include a promoter such as the lac promoter and 
for translation initiation the Shine-Dalgarno sequence 
and the start codon AUG. Similarly, a eukaryotic 
expression vector may include a heterologous or 

35 homologous promoter for RNA polymerase II, a 

downstream polyadenylation signal, the start codon 
AUG, and a termination codon for detachment of the 
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ribosome. Such vectors may be obtained commercially, 
or assembled from the sequences described by methods 
well known in the art. 

5 Nucleic acid molecules according to the invention may 
be inserted into the vectors described in an antisense 
orientation in order to provide for the production of 
antisense RNA. Antisense RNA or other antisense 
nucleic acids may be produced by synthetic means. 

10 

In accordance with the present invention, a defined 
nucleic acid includes not only the identical nucleic 
acid but also any minor base variations including in 
particular, substitutions in cases which -result in a 

15 synonymous codon (a different codon specifying the 

same amino acid residue) due to the degenerate code in 
conservative amino acid substitutions. The term 
^nucleic acid sequence" also includes the 
complementary sequence to any single stranded sequence 

20 given regarding base variations. 

The present invention also advantageously provides 
nucleic acid sequences of at least approximately 10 
contiguous nucleotides of a nucleic acid according to 

25 the invention and preferably from 10 to 50 nucleotides 
even more preferably, the nucleic acid sequence 
comprise the sequences illustrated in Figure 3. These 
sequences may, advantageously be used as probes or 
primers to initiate replication, or the like. Such 

30 nucleic acid sequences may be produced according to 
techniques well known in the art, such as by 
recombinant or synthetic means. They may also be used 
in diagnostic kits or the like for detecting the 
presence of a nucleic acid according to the invention. 

35 These tests generally comprise contacting the probe 
with the sample under hybridising conditions and 
detecting for the presence of any duplex or triplex 
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formation between the probe and any nucleic acid in 
the sample. 

The nucleic acid sequences according to this aspect of 
5 the present invention comprise the sequences of 
nucleotides illustrated in Figures 3 and 5. 

According to the -present invention these probes may be 
anchored to a solid support. Preferably, they are 

10 present on an array so that multiple probes can 
simultaneously hybridize to a single biological 
sample- The probes can be spotted onto the array or 
synthesised in situ on the array. (See Lockhart et 
ai., Nature Biotechnology, vol- 14, December 1996 

15 "Expression monitoring by hybridisation to high 

density oligonucleotide arrays' 7 . A single array can 
contain more than 100, 500 or even 1,000 different 
probes in discrete locations. 

20 The nucleic acid sequences, according to the invention 
may be produced using such recombinant or synthetic 
means, such as for example using PCR cloning 
mechanisms which generally involve making a pair of 
primers, which may be from approximately 10 to 50 

25 nucleotides to a region of the gene which is desired 
to be cloned, bringing the primers into contact with 
mRNA, cDNA, or genomic DNA from a human cell, 
performing a polymerase chain reaction under 
conditions which brings about amplification of the 

30 desired region, isolating the amplified region or 

fragment and recovering the amplified DNA. Generally, 
such techniques are well known in the art, such as 
described in Sambrook et al, (Molecular Cloning: a 
Laboratory Manual, 1989) . 

35 

The nucleic acids or oligonucleotides according to the 
invention may carry a revealing label- Suitable 
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labels include radioisotopes such as 32 P or 35 S, enzyme 
labels or other protein labels such as biotin or 
fluorescent markers. Such labels may be added to the 
nucleic acids or oligonucleotides of the invention and 
5 may be detected using known techniques per se. 

Advantageously, human allelic variants or 
polymorphisms of the DNA molecule according to the 
invention may be identified by, for example, probing 

10 cDNA or genomic libraries from a range of individuals, 
for example, from different populations- Furthermore, 
nucleic acids and probes according to the invention 
may be used to sequence genomic DNA from patients 
using techniques well known in the art, such as the 

15 Sanger Dideoxy chain termination method, which may, 
advantageously, ascertain any predisposition of a 
patient to certain disorders associated with a growth 
factor according to the invention. 

20 The protein according to the invention includes all 
possible amino acid variants encoded by the nucleic 
acid molecule according to the invention including a 
polypeptide encoded by said molecule and having 
conservative amino acid changes. Conservative amino 

25 acid substitution refers to a replacement of one or 
more amino acids in a protein as identified in Table 
1. Proteins or polypeptides according to the invention 
further include -variants of such sequences, including 
naturally occurring allelic variants which are 

30 substantially homologous to said proteins or 

polypeptides. In this context, substantial homology 
is regarded as a^ sequence which has at least 70%, 
preferably 80 or 90% and preferably 95% amino acid 
homology with the proteins or polypeptides encoded by 

35 the nucleic acid molecules according to the invention. 
The protein according to the invention may be 
recombinant, synthetic or naturally occurring, but is 
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The nucleic acid or protein according to the invention 
may be used as a medicament or in the preparation of a 
5 medicament for treating cancer or other diseases or 
conditions associated with expression of VEGF-X 
protein. 

Advantageously, the nucleic acid molecule or the 
10 protein according to the invention may be provided in 
a pharmaceutical composition together with a 
pharmacologically acceptable carrier, diluent or 
excipient therefor, 

15 The present invention is further directed to 

inhibiting VEGF-X in vivo by the use of antisense 
technology. Antisense technology can be used to 
control gene expression through triple-helix formation 
of antisense DNA or RNA, both of which methods are 

20 based on binding of a polynucleotide to DNA or RNA- 
For example, the 5' coding portion or the mature DNA 
sequence, which encodes for the protein of the present 
invention, is used to design an antisense RNA 
oligonucleotide of from 10 to 50 base pairs in length. 

25 A DNA oligonucleotide is designed to be complementary 
to a region of the gene involved in transcription 
(triple-helix - see Lee et al. Nucl. Acids Res., 
6:3073 (1979); Cooney etal., Science, 241:456 (1988); 
and Dervan et al,, Science, 251: 1360 (1991), thereby 

30 preventing transcription and the production of VEGF-X. 
The antisense RNA oligonucleotide hybridises to the 
mRNA in vivo and blocks translation of an mRNA 
molecule into the VEGF-X protein (antisense - Okano, 
J. Neurochem. , 56:560 (1991); Oligodeoxynucleotides as 

35 Antisense Inhibitors of Gene Expression, CRC Press, 
Boca Raton^FL (1988)). 
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Alternatively, the oligonucleotide described above can 
be delivered to cells by procedures in the art such 
that the anti-sense RNA and DNA may be expressed in 
vivo to inhibit production of VEGF-X in the manner 
5 described above. 

Antisense constructs to VEGF-X, therefore, may inhibit 
the angiogenic activity of VEGF-X and prevent the 
further growth of or even regress solid tumours, since 
10 angiogenesis and neovascularization are essential 
steps in solid tumour growth. These antisense 
constructs may also be used to treat rheumatoid 
arthritis, psoriasis and diabetic retinopathy which 
are all characterized by abnormal angiogenesis. 

15 

A further aspect of the invention provides a host cell 
or organism, transformed or transfected with an 
expression vector according to the invention. The 
host cell or organism may advantageously be used in a 
20 method of producing VEGF-X, which comprises recovering 
any expressed VEGF-X from the host or organism 
transformed or transfected with the expression vector. 

According to a further aspect of the invention there 
25 is also provided a transgenic cell, tissue or organism 
comprising a transgene capable of expressing VEGF-X 
protein according to the invention. The term 
^transgene capable of expression" as used herein means 
a suitable nucleic acid sequence which leads to 
30 expression of VEGF-X or proteins having the same 

function and/or activity. The transgene, may include, 
for example, genomic nucleic acid isolated from human 
cells or synthetic nucleic acid, including DNA 
integrated into the genome or in an extrachromosomal 
35 state. Preferably, the transgene comprises the 

nucleic acid sequence encoding the proteins according 
to the invention as described herein, or a functional 
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fragment of said nucleic acid. A functional fragment 
of said nucleic acid should be taken to mean a 
fragment of the gene comprising said nucleic acid 
coding for the proteins according to the invention or 
5 a functional equivalent, derivative or a non- 
functional derivative such as a dominant negative 
mutant/ or bioprecursor of said proteins. For 
example, it would be readily apparent to persons 
skilled in the art that nucleotide substitutions or 
10 deletions may be used using routine techniques, which 
do not affect the protein sequence encoded by said 
nucleic acid, or which encode a functional protein 
according to the invention. 

15 VEGF-x protein expressed by said transgenic cell, 
tissue or organism or a functional equivalent or 
bioprecursor of said protein also forms part of the 
present invention. 

20 Antibodies to the protein or polypeptide of the 

present invention may, advantageously, be prepared by 
techniques which are known in the art- For example, 
polyclonal antibodies may be prepared by inoculating a 
host animal, such as a mouse or rabbit, with the 

25 polypeptide according to the invention or an epitope 
thereof and recovering immune serum. Monoclonal 
antibodies may be prepared according to known 
techniques such as described by Kohler R. and Milstein 
C. , Nature (1975) 256, 495-497. Advantageously, such 

30 antibodies may be included in a kit for identifying 

VEGF-X in a sample, together with means for contacting 
the antibody with the sample. 

Advantageously, the antibody according to the 
35 invention may also be used as a medicament or in the 
preparation of a medicament for treating tumours or 
other diseases associated with expression of VEGF-X- 
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The invention also further provides a pharmaceutical 
composition comprising said antibody together with a 
pharmaceutically acceptable carrier diluent or 
excipient therefor. 

5 

Proteins which interact with the polypeptide of the 
invention may be identified by investigating protein- 
interactions using the two-hybrid vector system first 
proposed by Chien et al., (1991) Proc. Natl. Acad. 
10 Sci. USA 88 : 9578-9582. 

This technique is based on functional reconstitution 
in vivo of a transcription factor which activates a 
reporter gene. More particularly the technique 

15 comprises providing an appropriate host cell with a 
DNA construct comprising a reporter gene under the 
control of a promoter regulated by a transcription 
factor having a DNA binding domain and an activating 
domain, expressing in the host cell a first hybrid DMA 

20 sequence encoding a first fusion of a fragment or all 
of a nucleic acid sequence according to the invention 
and either said DNA binding domain or said activating 
domain of the transcription factor, expressing in the 
host at least one second hybrid DNA sequence, such as 

25 . a library or the like, encoding putative binding 
proteins to be investigated together with the DNA 
binding or activating domain of the transcription 
factor which is not incorporated in the first fusion; 
detecting any binding of the proteins to be 

30 investigated with a protein according to the invention 
by detecting for the presence of any reporter gene 
product in the host cell; optionally isolating second 
hybrid DNA sequences encoding the binding protein. 

35 An example of such a technique utilises the GAL 4 

protein in yeast. GAL4 is a transcriptional activator 
of galactose metabolism in yeast and has a separate 



WO 00/37641 



PCT/US99/30503 



- 15 - 

domain for binding to activators upstream of the 
galactose metabolising genes as well as a protein 
binding domain. Nucleotide vectors may be 
constructed, one of which comprises the nucleotide 
5 residues encoding the DNA binding domain of GAL4 . 

These binding domain residues may be fused to a known 
protein encoding sequence, such as for example, the 
nucleic acids according to the invention. The other 
vector comprises the residues encoding the protein 

10 binding domain of GAL4. These residues are fused to 
residues encoding a test protein. Any interaction 
between polypeptides encoded by the nucleic acid 
; according to. the invention and the protein to be 
tested leads to transcriptional activation of a 

15 reporter molecule in a GAL-4 transcription deficient 
yeast cell into which the vectors have been 
transformed. Preferably, a reporter molecule such as 
p-galactosidase is activated upon restoration of 
transcription of the yeast galactose metabolism genes. 

20 

A further aspect of the present invention also 
provides a method of identifying VEGF-X in a sample, 
which method comprises contacting said sample with an 
antibody according to the invention and monitoring for 
25 any binding of any proteins to said antibody. A kit 
for identifying the presence of VEGF-X in a sample is 
also provided comprising an antibody according to the 
invention and means for contacting said antibody with 
said sample. 

30 

VEGF-X may be recovered and purified from recombinant 
cell cultures by methods known in the art, including 
ammonium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, 
35 phosphocellulose chromatography, hydrophobic 

interaction chromatography, affinity chromatography, 
hydroxyapatite chromatography and lectin 
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chromatography . 

The VEGF-X protein of the present invention may be a 
naturally purified product, or a product of chemical 
5 synthetic procedures, or produced by recombinant 

techniques from a prokaryotic or eukaryotic host (for 
example, by bacterial yeast, higher plant, insect and 
mammalian cells in culture) . Depending upon the host 
employed in a recombinant production procedure, the 
10 polypeptides of the present invention may be 

glycosylated with mammalian or other eukaryotic 
carbohydrates or may be non-glycosylated. 

VEGF-X is particularly advantageous as a wound healing 
15 agent, where, for example, it is necessary to re- 

vascularize damaged tissues, or where new capillary 
angiogenesis is important. Accordingly, VEGF-X may be 
used for treatment of various types of wounds such as 
for example, dermal ulcers, including pressure sores,. 
20 venous ulcers, and diabetic ulcers. In addition, it 
can be used in the treatment of full-thickness burns 
and injuries where angiogenesis is desired to prepare 
the burn in injured sites for a skin graft and flap. 
In this case, VEGF-X or the nucleic acid encoding it 
25 may be applied directly to the wound. VEGF-X may be 
used in plastic surgery when reconstruction is 
required following a burn, other trauma, or even for 
cosmetic purposes. 

30 An important application of VEGF-X is to induce the 
growth of damaged bone, periodontium or ligament 
tissue. * For example, it may be used in periodontal 
disease where VEGF-X is applied to the roots of the 
diseased teeth, leading to the formation of new bone 

35 and cementum with collagen fibre ingrowths. It can be 
used for regenerating supporting tissues of teeth, 
including alveolar bone, cementum and periodontal 
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ligament, that have been damaged by disease and 
trauma. 

Since angiogenesis is important in keeping wounds 
5 clean and non-inf ected, VEGF-X may be used in 

association witiv surgery and following the repair of 
cuts. It should be particularly useful in the 
treatment of abdominal wounds where there is a high 
risk of infection. 

10 

VEGF-X can also be used for the promotion of 
endothelialization in vascular graft surgery. In the 
case of vascular grafts using either transplanted or 
synthetic material, VEGF-X may be applied to the 

15 surface of the graft or at the junction to promote the 
growth of the vascular endothelial cells. One 
derivation of this is that VEGF-X can be used to 
repair the damage of myocardial and other occasions 
where coronary bypass surgery is needed by stimulating 

20 the growth of the transplanted tissue. Related to 
this is the use of VEGFX to repair the cardiac 
vascular system after ischemia. 

The protein of the present invention may also be 
25 employed in accordance with the present invention by 
expression of such protein in vivo, which is often 
referred to as v gene therapy". 

Thus, for example/ cells such as bone marrow cells may 
30 be engineered with a polynucleotide (DNA or RNA) 

encoding for the protein ex vivo as defined herein/ 
the engineered cells are then provided 'to a patient to 
be treated, with the polypeptide. Such methods are 
well-known in the art- For example, cells may be 
35 engineered by procedures known in the art by use of a 
retroviral particle containing RNA encoding for the 
protein of the present invention. 
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Similarly, cells may be engineered in vivo for 
expression of the protein in vivo, for example, by 
procedures known in the art. 

5 A further aspect of the invention comprises a method 
of treating a disorder mediated by expression of a 
protein according to the invention, by administering 
to a patient an amount of an antisense molecule as 
described herein, in sufficient concentration to 
10 alleviate or reduce the symptoms of said disorder. 

Compounds which inhibit or enhance angiogenesis may be 
identified by providing a host cell or organism 
according to the invention or a transgenic cell, 

15 tissue or organism according to the invention, 

contacting a test compound with said cell, tissue or 
organism and monitoring for the effect of said 
compound compared to a cell tissue or organism which 
has not been contacted with said compound. These 

20 compounds may themselves be used as a medicament or 

included in a pharmaceutical composition for treatment 
of disorders mediated by inappropriate vascularisation 
or angiogenic activity. 

25 The present inventors have also, advantageously, 

identified in the sequence encoding the VEGF-X protein 
a CUB domain, which has heretofore not. previously been 
identified in VEGF-type growth factors. The VEGF-X 
protein may therefore exert dual regulatory effects 

30 via interaction with the VEGF tyrosine kinase 

receptors or with neuropilin receptors mediated by the 
CUB domain. Thus, the sequence encoding said CUB 
domain may be included in an expression vector for 
subsequent transformation of a host cell, tissue or 

35 organism. 

VEGF-X or fragments thereof may be able to modulate 
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the effects of pro-angiogenic growth factors such as 
VEGF as indicated in the findings presented in the 
examples below that the N-terminal part of the VEGF-X 
protein, a CUB-like domain, is able to inhibit VEGF- 
5 stimulated proliferation of HUVECs. VEGF-X or 

fragments thereof may therefore be useful in therapy 
of conditions involving inappropriate angiogenesis. 
Inhibition of the angiogenic activity of VEGF has 
been linked with inhibition of tumour growth in 

10 several models eg Kim K. J. et al, Nature 362:841- 
844, (1993) . Additionally, agents able to inhibit 
angiogenesis would be expected to be useful in 
treating other angiogenesis-dependent diseases such a 
retinopathy, osteoarthritis and psoriasis (Folkman, 

15 J., Nature Medicine 1:27-31, (1995). 

As identified in more detail in the Examples 
described herein the present inventors have 
surprisingly identified that the CUB domain of VEGF-X 

20 is able to inhibit stimulation of proliferation of 

HUVECs induced by either VEGF or bFGF. The CUB domain 
may, therefore, be utilised as a therapuetic agent 
for inhibition of angiogenesis and for treatment of 
condition associated with inappropriate 

25 vascularisation or angiogenesis. 

Therefore according to a further aspect of the 
invention there is provided a method of inhibiting 
angiogenic activity and inappropriate vascularisation 

30 including formation and proliferation of new blood 
vessels, growth and development of tissues, tissue 
regeneration and organ and tissue repair in a subject 
said method comprising administering to said subject 
an amount of a polypeptide having an amino acid 

35 sequence from position 40 to 150 of the sequence 

illustrated in Figure 10 or a nucleic acid molecule 
encoding the CUB domain according to the invention in 
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sufficient concentration to reduce or prevent said 
angiogenic activity. 

Furthermore there is also provided a method of 
5 treating or preventing any of cancer, rheumatoid 

arthritis, psoriasis and diabetic retinopathy , said 
method comprising administering to said subject an 
amount of a polypeptide having an amino acid sequence 
from position 40 to 150 of the sequence illustrated 
10 in Figure 10 or a nucleic acid molecule encoding the 
CUB domain according to the invention in sufficient 
concentration to treat or prevent said disorders. 

The CUB domain may also be used to identify compounds 
15 that inhibit or enhance angiogenic activity such as 

inappropriate vascularisation, in a method comprising 
contacting a cell expressing a VEGF receptor and/or a 
neuropilin 1 or 2 type receptor with said compound in 
the presence of a VEGF-X protein according to the 
20 invention and monitoring for the effect of said 

compound or said cell when compared to a cell which 
has not been contacted with said compound. Such 
compounds may then be used as appropriate to prevent 
or inhibit angiogenic activity to treat the disorders 
25 or conditions described herein, or in a 

pharmaceutical composition. An antibody to said CUB 
domain may also be useful in identifying other 
proteins having said sequences . 



35 
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Deposited Plasmids 

Date of Deposit Accession No. 

Plasmid VEGFX/pCR2.1 
5 1T0P0 FL 1 March 1999 LMBP 3925 



Plasmid VEGFX/pRSETB BD 

amino acids 1 March 1999 LMBP 3926 

10 G230-G345 

Plasmid VEGFX/pcR. 2. 1 

FL Clone 9 20 October 1999 LMBP 3977 

IS Plasmid VEGF-X CUB 

PET22b 20 December 1999 

The above plasmids were deposited at the Belgian 
Coordinated Collections of Microorganisms (BCCM) at 
20 Laboratorium Voor Moleculaire Biologie- 

Plasmidencollectie (LMBP) B-9000, Ghent, Belgium, in 
accordance with the provisions of the Budapest Treaty 
of 28 April 1977. 

25 The invention may be more clearly understood with 
reference to the accompanying example, which is 
purely exemplary, with reference to the accompanying 
drawings, wherein: 

30 Figure 1: is a DNA sequence identified in the 

Incyte LifeSeq™ database coding for a 
novel VEGF-X protein. 



35 



Figure 2: 



is an illustration of amino acid 
sequence of the nucleic acid sequence 
of Figure 1. 
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Figure 3; is an illustration of PCR primer 

sequences utilised to identify the 
VEGF-X protein according to the 
invention. 



10 



Figure 4 : 



is a diagrammatic illustration of the 
spatial relationships in the VEGF-X 
sequence of the clones identified 
using the PCR primer sequences of 
Figure 3. 



15 



Figure 5 : 



Figure 6: 



is an illustration of the nucleotide 
sequences of the 5' RACE primers used 
to identify the 5' end of the VEGF-X 
open reading frame . 

is an illustration of the sequence 
obtained from the RACE experiment • 



20 Figure 1 : is an illustration of the nucleotide 

sequences obtained from the search of 
LifeSeq™ database using the sequence 
in Figure 6. 



25 



Figure 8: 



is an illustration of the primers used 
to clone the entire coding sequence of 
VEGF-X. 



30 



Figure 9: is an illustration of the entire 

coding sequence of vegf-X. 



35 



Figure 10: 



Figure 11: 



is an illustration of the predicted 
amino acid sequence of the nucleotide 
sequence of Figure 9. 

is an alignment of the sequence of 
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Figure 13: 

10 



15 

Figure 14: 

20 

Figure 15: 

25 

Figure 16: 



30 

Figure 17: 
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Figure 10 with the sequences of VEGF-A 
to D. 

is an illustration of variant 
sequences of the VEGF-X protein 
according to the invention, 

is an illustration of the 
oligonucleotide primers used for 
E.coli expression of, VEGF-X domains 
and for expression of the full length 
sequence of VEGF-X in a 
baculovirus/insect cell expression 
system, 

depicts nucleic acid sequences of 18 
human EST clones obtained from a BLAST 
search of the LifeSeq™ database used 
to identify the full sequence encoding 
VEGF-X. 

depicts the nucleotide sequences of 50 
human EST clones obtained from the 
LifeSeq™ database. 

is an illustration of nucleotide 
sequences utilised as primers to 
identify the nucleotide sequence 
encoding VEGF-X. 

is a nucleotide sequence coding for a 
partial VEGF-X protein according to 
the invention. 

is an illustration of a partial 
nucleotide sequence encoding VEGF-X 
protein according to the invention- 
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Figure 20: 
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is an illustration of a DNA and 
polypeptide sequence used for 
mammalian cell expression of VEGF-X. 
The predicted VEGF-X signal sequence 
is in lower case letters. The C- 
terminal V5 epitope and His6 sequences 
are underlined, 

is an illustration of a DNA and 
polypeptide sequence used for 
baculovirus/insect cell expression of 
VEGF-X. In the polypeptide sequence 
the signal sequence is shown in lower 
case* The N-terminal peptide tag 
added to the predicted mature VEGF-X 
sequence is* underlined. 

is an illustration of a DNA and 
polypeptide sequence used for E. coli 
expression of VEGF-X. The polypeptide 
sequences at the N- and C- termini 
derived from the MBP fusion and His 6 
tag respectively are underlined. 

illustrates the disulphide-linked 
dimerisation of VEGF-X. Protein 
samples were analysed by SDS-PAGE. 
Prior to loading the .gel, samples were 
heated to 95 °C for 5 minutes in sample 
buffer in the presence (+) or absence 
(-) of reducing agent. (A) samples 
from COS cell expression of a C- 
terminally V5/His6 peptide-tagged 
construct. The left hand panel is 
total conditioned medium, the right 
hand panel is material purified on 
Nickel agarose resin. Reduced monomer 
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15 

Figure 23: 

20 



25 



Figure 24: 

30 



35 

Figure 25: 



- 25 r 

and putative disulphide-linked, non- 
reduced dimer are indicated by arrows. 
There appears to be proteolysis of the 
protein during purification. Gels were 
blotted onto nylon membranes and 
protein detected with an anti V5 
monoclonal antibody. (B) Samples from 
E.coli expression of a maltose-binding 
protein/His6 dual fusion construct, M 
indicates the molecular weight markers 
(Benchmark, Lif eTechnologies) . The 
gel was stained with Coomassie Blue by 
standard procedures. The fusion 
protein has an apparent molecular 
weight of 80kDa. 

illustrates the glycosylation of VEGF- 
X. VEGF-X was purified from the 
culture supernatant of COS cells 
transfected with the pcDNA6/V5-His 
construct. Supernatants were 
harvested 72h post-transf ection and 
purified on nickel resin. Samples 
were then treated with EndoH ( + ) or 
untreated (-) before SDS-PAGE and 
blotting, as described in the legend 
to Figure 22. 

is an illustration of the DNA and 
polypeptide sequence used for E. coli 
expression of the VEGF-like domain of 
VEGF-X. Polypeptide sequences at the 
N-terminus of the protein derived from 
the vector are underlined. 

shows expression of the VEGF-X VEGF 
domain in E. coli. Lane 1-10^1 broad 
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Figure 27: 

20 

Figure 28: 

25 
30 

Figure 29: 

35 
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range marker (New England Biolabs), 
lane 2-lQyil unreduced sample, lane 3- 
lOyil reduced sample. The reduced PDGF 
domain protein (lane 3) has an 
apparent molecular weight of 
approximately 19kDa on SDS-PAGE. 

illustrates a DNA and polypeptide 
sequence used for E. coii expression 
of the CUB-like domain of VEGF-X. The 
polypeptide sequence at the N-terminus 
derived from the vector-encoded signal 
and the introduced His6 tag are 
underlined. 

shows expression of the VEGF-X CUB 
domain in E. coli. The CUB domain 
protein was purified on Nickel chelate 
resin. The protein migrates at 
approximately 23kDa on SDS-PAGE. 

illustrates the effect ox truncated 
VEGF-X (CUB domain) on HUVEC 
proliferation. (A) Human Umbilical 
Vein Endothelial Cells (one-day- 
treatment) . (B). Human Umbilical Vein 
Endothelial Cells (24-hour starving 
followed by one-day-treatment) . (C) 
Effect of VEGF-A l€3 and VEGF-X CUB 
domain on the proliferation of HUVEC 
(two-day-treatment) . 

depicts the tissue distribution of 
VEGF-X mRNA analysed by Northern 
blotting and RT-PCR in (A) normal 
tissues and (B) tumour tissue and cell 
lines. 
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10 



15 



Figure 30: depicts the partial intron/exon 

structure of the VEGF-X gene. (A) 
Genomic DNA sequences of 2 exons 
determined by sequencing; exon 
sequence is in upper case, intron 
sequence is in lower case. (B) Shows 
the location of splice sites within 
the VEGF-X cDNA sequence. The 
location of mRNA splicing events is 
indicated by vertical lines. The 
cryptic splice donor/acceptor site at 
nt. 998/999 (diagonal lines) gives 
rise to the splice variant forms of 
VEGF-X- No splice site information is 
given for the region shown in italics. 



20 



Figure 31: is a graphic representation of the 

effect of FL- VEGF-X on HuVEC 
proliferation: (24 hour serum 
starvation followed by one day 
treatment) . 



25 



Figure 32: is a graphic representation of the 

combined effect of truncated VEGF-X 
(CUB domain) and human recombinant 
VEGF 1W on HuVEC proliferation: (24 hour 
serum starvation followed by two day 
treatment) . 



30 



35 



Figure 33: is a graphic representation of the 

combined effect of the CUB domain and 
human recombinant bFGF on HuVEC 
proliferation: (24 hour serum 
starvation followed by two day 
treatment) - 



Figure 34: 



is a graphic representation of the 
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results of a LDH assay for testing 
cytotoxicity of the CUB domain or the 
CUB domain with rhVEGF U5 . 

Figure 35: is a graphic representation of the 

results obtained from a LDH assay for 
testing cytotoxicity of the CUB domain 
or CUB domain with rh-bFGF. 

A BLAST (Basic Local Alignment Search Tool; Altschul 
et al./ 1990 J. Mol . Biol. 215, 403-410) search was 
performed in the proprietary LifeSeq™ human EST 
database (Incyte Pharmaceuticals/ Inc., Palo Alto, 
CA, USA) - BLAST produces alignments of both 
nucleotide and amino acid sequences to determine 
sequence similarity. Because of the local nature of 
the alignments, BLAST is especially useful in 
determining exact matches or in identifying 
homologues. While it is useful for matches which do 
not contain gaps, it is inappropriate for performing 
motif-style searching. The. fundamental unit of BLAST 
algorithm output is the High-scoring Segment Pair 
(HSP) . 

Eighteen human EST clones (Figure 14) with high 
similarity to the previously identified VEGF proteins 
were identified and a further fifty EST clones 
(Figure 15) were identified using these sequences as 
query sequences, allowing us to deduce the putative 
sequence for the new VEGF-X protein. The sequences 
obtained were compared to known sequences to 
determine regions of homology and to identify the 
sequence as a novel VEGF-type protein. Using the DNA 
sequence information in the databases we were able to 
prepare suitable primers having the sequences of 
VEGF-X 1-10 illustrated in Figure 3 for use in 
subsequent RACE experiments to obtain the complete 
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DNA sequence for the VEGF-X gene. 
Cloning 

5 A profile was developed based on the VEGF-like domain, 
in existing VEGF sequences (VEGF-A, B, C and D) . 
This was used to search the public databases and the 
Incyte LifeSeq™ database. No significant novel 
matching sequences were found in the public 
10 databases. All of the matching .sequences found in 

the LifeSeq™ database (-1000) were assembled to give 
a smaller number of sequences (-30) , which included 
the known VEGFs and a potential novel VEGF (figures 
1 and 2) . This sequence was named VEGF-X. 

15 

Oligonucleotides were designed to amplify the VEGF-X 
sequence from cDNA (figure 3). The ESTs found in 
LifeSeq™ were from a range of tissues, with a slight 
predominance of sequences from ovary, testis, 
20 placenta and lung (Figure 14 and 15) . Accordingly 
the oligonucleotides were used to amplify cDNA 
derived from lung and placenta. First-round PCR 
products were found at -200bp larger than the 
expected sizes, while 3 major species appeared after 
25 a second round of PCR amplification, the smallest of 
which was of the expected size. These fragments were 
cloned and sequenced. The smallest fragment did 
indeed have the sequence originally identified from 
the LifeSeq database, while the others contained 
30 insertions (figure 4) ♦ 

As. the first round of amplification suggested that 
the major species found in cDNA from ovary and 
placenta was not that originally identified in the 
35 LifeSeq™ database, the focus of effort was switched 
to the presumed major species (it seemed likely that 
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clones 57, 2S-27 and 2.1kb clones 1-3 in fig 4 
represented the major mRNA species) . Conceptual . 
translation of the DNA sequences of these cloned PGR 
fragments indicated that the complete open reading 
5 frame was not present in the clones or in the 

sequence from LifeSeq™. While all clones contained 
the same sequence in the region of the translation 
termination codoh, indicating that the end of the 
open reading frame had been identified, the 5' end of 
10 the open reading frame had not been cloned. 5' RACE 
experiments were therefore carried out in order to 
find the start of the reading frame. PGR primers 
designed for RACE experiments are shown in figure 5. 
RACE PCR products were sequenced directly. Sequence 
15 could be obtained from the 3' end of these RACE 

products but not from the 5' end; probably because 
the products were not cloned and were therefore 
heterogeneous at the 5' end. This new sequence was 
assembled with the existing cloned sequence to give 
20 the sequence shown in figure 6. Searching the 

LifeSeq™ database with this sequence identifies ESTs 
which extend the sequence a further 140bp in the 5' 
direction and a further 160bp in the 3' direction 
(figure 7) . This longer contig was used to design 
25 oligonucleotide primers to amplify the entire coding 
sequence (these primer sequences are shown in figure 
8). PCR was carried out using primers 5'-l and 
vegfXIO (in order to clone a "full-length'' cDNA) , and 
with primers 5'-l and vegfX6 (in order to clone the 
30 full coding region, see figure 3 for sequences of 
vegfXIO and vegfX6) . A number of clones were 
obtained for the shorter fragment, of which clones A 
and 7 contain no PCR errors (sequence of clones 4 & 7 
in figure 9) . A single clone was obtained for the 
35 longer fragment (clone 9) , but this sequence appears 
to contain 2 PCR errors. 



The predicted polypeptide from these longer contigs 
is shown in figuxe 10. Amino acids 1-22 are 
predicted to encode a signal sequence (von Heijne, 
1986, Nucleic Acids Res. 14, 4683-4690) . Figure 11 
shows an alignment of the protein sequence with VEGFs 
A-D. The region homologous to the other VEGFs is 
located towards the C-terminus of the protein. As 
the VEGF homology domain is expected to belong to the 
TGF-beta superfamily of growth factors and to consist 
of a dimer containing both intra- and intermolecular 
disulphide bonds, initial alignments focussed on the 
cysteines. However/ mapping of the sequence onto the 
known x-ray structure of the VEGF-A receptor-binding 
domain (Muller et al (1997) Proc. Natl. Acad. Sci USA 
94, 7192-7197) suggests that the alignment in fignxe 
11 is plausible, as the extra 4 cysteine residues 
within the VEGF-homology region of VEGF-X (compared 
to this region of VEGF-A) correspond to residues 
which are spatially close in VEGF-A, and may 
therefore be able to form disulphide bonds. 

A search of the PFAM database of protein domains with 
the full-length polypeptide sequence from figure 10 
identifies two domain consensus sequences within the 
polypeptide. The more C-terminal domain is a "VEGF" 
domain: (the known VEGFs all contain this domain and 
the structure of this region of VEGF-A is similar to 
that of PDGF) . Additionally towards the N-terminus 
of the polypeptide there is a CUB domain (amino acids 
-40-150) . The CUB domain is a 100-110 amino acid 
extracellular domain found in a number of 
developmentally- regulated proteins. When the full- 
length protein is used to search the protein 
databases using the BLAST 2 algorithm, the scores for 
matches to CUB domain-containing proteins are more 
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significant than those to the other VEGFs. 
Interestingly, the most significant matches are to 
the CUB domains of Neuropilins, and Neuropilin-1 was 
recently identified as a receptor of one of the VEGF- 
5 A isoforms VEGF-A 165 (Soker et al. (1998) Cell 92, 735- 
745). 

Assuming that the variant sequences isolated by PCR 
(i.e. the smaller PCR fragments) use the same 

10 translation initiation site as the full-length 

sequence, they would result in production of the 
variant proteins shown in figure 12. It may be 
significant that both of these variant proteins 
retain the CUB domain and delete all or part of the 

15 VEGF-like domain. The production of these variant 
sequences can be explained by the use of a cryptic 
splice donor/acceptor site within the VEGF-X sequence 
(figure 30B, between nt. 998/999): one variant arises 
fey splicing out of the region between nt. 72 9-998, 

20 the other by splicing out of the region between nt. 
999-1187. 

Expression 

25 Full-length expression constructs 

Mammalian cells 

Clone 4 containing the full CDS of VEGF-X (see figure 
9) , was used to generate constructs for expression of 
full-length protein. The sequence was amplified by 

30 PCR and cloned into the vector pCDNA6/V5-His so as to 
add a C-terminal V5 epitope tag and His 6 tag. The DNA 
and polypeptide sequence in this vector is shown in 
figure 19. Transient expression in COS cells 
followed by western blotting and detection via an . 

35 anti-V5 mAb demonstrates the secretion of a protein 
of ~50K into the medium in transfected cells only 
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(figure 22A) . This construct can also be used to 
generate VEGF-X expressing stable CHO cell lines. 

Baculovi rus/Insect-cell expression system 
5 For expression in the baculovirus/insect cell system 
the DNA encoding the predicted mature VEGF-X 
polypeptide sequence was fused to a sequence encoding 
a signal derived from melittin, a secreted insect 
protein. An N-terminal 6His tag was also added to 

10 facilitate purification. The insert was then cloned 
into the baculovirus expression vector pFASTBAC. The 
DNA and polypeptide sequence of this construct is 
shown in figure 20. Infection of Trichcplusia ni Hi5 
cells with this recombinant baculovirus results in 

15 the secretion of a protein of approximately 4 5K into 
the medium (data not shown) . 



20 



E. coli 

The coding region of VEGF-X has been cloned in a 
variety of ways for expression as a secreted protein 
in E.coll. A particularly useful expression clone 
carries an N-terminal fusion to the E.coli 
maltose-binding protein (MBP- derived from the 
expression vector pMAL-p2, New England Biolabs) and a 
25 C-terminal fusion to a 6His tag. The DNA and 

polypeptide sequence of this vector is shown in 
figure 21. Sequential purification of cell fractions 
on Ni-NTA resin and amylose resin allows the 
isolation of the expressed protein (see figure 22B) . 

30 

expression of fragments 

VEGF 

The VEGF domain of VEGF-X has been expressed in 
E.coli. Similar domains from VEGF-A (Christinger et 
35 al. (1996) PROTEINS: Structure, Function and Genetics 
26, 353-357), and VEGF-D (Achen et al (1998) Proc. 
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Natl. Acad. Scl USA 95, 548-553) have been shown to 
be capable of binding to the respective receptors. 
Expression of these domains was carried out using the 
bacterium E.coll. Additionally, the full-length 
5 protein-. was expressed using the baculovirus/insect 

cell expression system. The oligonucleotide primers 
which have been obtained for these experiments are 
shown in figure. 13. The construct directed 
expression in the bacterial cytoplasm, and as 

10 expected the protein was produced in insoluble form 

in inclusion bodies (the DNA and polypeptide sequence 
used for PDGF domain expression is shown in figure 
24) • Inclusion bodies were washed, solubilized with 
urea and the protein purified under denaturing 

15 conditions, before refolding by dialysis to remove 
the urea. Soluble protein was obtained, but shows 
little evidence of the disulphide bond linked dimers 
seen with material derived from animal cells (figure 
25, compare with figure 22A & B) . It is not clear 

20 therefore whether this protein is correctly folded. 

CUB 

The CUB domain has been expressed as a soluble 
secreted protein in E,coli (figure 26) . The protein 
25 was purified by binding to Ni-NTA resin (figure 27) 
and assayed for activity on KUVECs in an in-vitro 
proliferation assay. 

Properties of the VEGF-X protein 
30 The transient mammalian cell expression system 

described above has been used to generate full-length 
VEGF-X protein, as shown by antibody detection 
following Western blotting (see figure 22A) . 



35 



Disulphide bond linked dimers 

The other members of the PDGF family of growth 
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factors, the PDGFs and VEGFs, all exist as dimers in 
which two monomers constituting the dimer are linked 
by interchain disulphide bonds. The x-ray structures 
of PDGF-BB (Oefner et al, 1992), and VEGF-A (Muller 
5 et al, 1997} are known and indicate that at least 
these two members of the family contain two 
interchain disulphide bonds. Practi cally this means 
that in SDS-.PAGE analysis of these growth factors the 
presence of interchain disulphide bonds is shown by a 

10 large decrease in mobility in the absence of reducing 
agent (ie. the nonreduced dimer migrates more slowly 
through the gel than the reduced monomer) • This 
effect was also expected for VEGF-X, and has been 
demonstrated for the material obtained from transient 

15 - mammalian cell expression (figure 22A) - In the case 
of the full length material produced in E.coli only 
some 10% of the total VEGF-X protein appears to be 
present as disulphide bond-linked dimers (figure 
22B) . However, these results provide evidence that 

20 the mammalian cell-derived protein is correctly 
folded, and that a portion of the E. coii-derived 
protein is too. 

Glvcosylation 

25 There are 3 predicted potential N-linked 

glvcosylation sites within the VEGF-X protein: at 
residues 25, 55 and 254 of the polypeptide sequence. 
The predicted molecular mass of the mature VEGF-X 
protein is 40kDa, but SDS-PAGE and western blotting 

30 (detection via an introduced C-terminal epitope tag- 
see figure 19) of the full-length protein expressed 
in COS cells gives a band slightly larger than the 
expected size (45-50kDa) as well as one at 25kDa 
(figure 22A) - This smaller band is presumed to be a 

35 C-terminal proteolysis fragment derived from the 

full-length molecule (controls from uninfected cells 
do not show this band) , probably corresponding to a 
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cleavage between the CUB and VEGF domains. EndoH 
treatment of the preparation gives a slight mobility 
change for the full-length protein (figure 23), but 
for the smaller VEGF domain fragment there is a clear 
5 change, indicating that the predicted glycosylation . 
site within the VEGF domain at residue 254 is indeed 
glycosylated. 

Activity o£ proteins in cell-based assays 
10 Protein samples were tested for activity in cell 
proliferation/ cell migration and in-vitro 
angiogenesis assays. Active samples can also be 
tested in the in vivo matrigel mouse model of 
angiogenesis. 

15 

Full-length VEGF-X protein 

Conditioned medium derived from COS cells transiently 
expressing VEGF-X (see figure 22A) displayed no 
detectable activity in any of the assays. However, 

20 as VEGF-X protein could only be detected in this 
preparation by Western blotting/ and not by 
Coomassie-staining of gels, it is clearly present at 
very low levels and this may be the reason for the 
observed lack of activity in the cell proliferation, 

25 migration or in vitro angiogenesis tests • 

VEGF domain 

The VEGF domain protein described above has been 
tested in cell proliferation (on a range, of cell 
30 types), cell migration and in vitro angiogenesis 
assays and has failed to show activity in any of 
these tests. As suggested above, this may be due to 
incorrect folding of this protein. 



35 



CUB domain 

The CUB domain protein at the highest dose tested 
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(lpg/ml) appears to inhibit proliferation of HUVECs 
in the absence of other stimulation (figure 28A & B) . 
This effect is also seen following stimulation with 
the lowest VEGF-A l6S dose tested (Ing/ml- figure 28C) . 
5 The CUB domain of VEGF-X therefore appears to show 
antiproliferative activity on HUVECs, even in the 
presence of low VEGF-A L6 5 closes. 

Tissue distribution of mRNA 

10 VEGF-A mRNA expression has been shown to be 

upregulated in a wide variety of human tumors (lung, 
breast, ovarian, colon, stomach, liver, pancreas, 
kidney, bladder and prostate- Takahashi et al, 1995) . 
Tumor VEGF-A expression has been shown to correlate 

15 with tumor growth rate, microvascular density and 
tumor metastasis (Takahashi et al, 1995) . It was 
thus of interest to examine the mRNA expression 
patterns of VEGF-X. Accordingly, Northern blot 
analysis of mRNA derived from different tissues has 

20 been carried out. The results indicate that although 
the VEGF-X mRNA is expressed at low levels, it is 
present in a wide range of tissues. PCR 
amplification of cDNA from a range of tissue sources 
supports this idea (figure 29A) . The major mRNA 

25 species is approximately 3.1kb in size. There is 

no significant upregulation seen in tumour cell lines 
or in tumour tissues tested (figure 29B), with the 
possible exception of the cell lines GI-117 (lung 
carcinoma) and SaOS-2 (osteosarcoma) . The results of 

30 these initial tissue distribution studies do not, 
therefore, provide evidence for upregulation of 
VEGF-X in tumour growth, as is seen with VEGF-A, 

Genomic structure of the VEGF-X gene 
35 A genomic BAG clone covering the 3' part of the 

VEGF-X locus was isolated by hybridisation screening 
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of nylon filters containing a human BAC library. 
Direct sequencing of this clone using oligonucleotide 
primers based on the VEGF-X cDNA sequence allowed the 
determination of several intron/exon boundaries 
5 {figure 30). Interestingly, the position of the mRNA 

splice site within the PDGF domain (nt 1187/1188 in 
figure 30B) is conserved with respect to those in the 
VEGF-A and VEGF-D genes (Tischer et al, 1991; 
Rocchigiani et al, 1998). 

10 

Materials & Methods 

FCR, Cloning, DNA sequence determination and BAC 
screening . 

15 All primers were purchased from Eurogentec, Seraing, 
Belgium. Insert-specific sequencing primers (15- and 16- 
mers) were designed by visual inspection of the DNA 
sequences. DNA was prepared on Qiagen-tip-20 columns or 
on Qiaquick spin columns (Qiagen GmbH, Dttsseldorf, 

20 Germany) and recovered from the spin columns in 30ul 
Tris/EDTA-buffer (lOmM TrisHCl pH 7.5, 1 mM EDTA (sodium 
salt) ) . Sequencing reactions were performed using 
BigDye™ Terminator Cycle Sequencing Ready Reaction kits 
(Perkin Elmer, ABI Division, Foster City, CA, USA) and 

25 were run on an Applied Biosystems 377 DNA sequencer 
(Perkin Elmer, ABI Division, Foster City, CA, USA) . 
Polymerase chain reactions were carried out according 
to standard procedures (Ausubel et al, 1997) . The 
PCR fragments were cloned into vectors pCR2 . 1 

30 (Invitrogen, Carlsbad, CA. USA) or pCR-TOPO 

(Invitrogen,NL) according to the manufacturer's 
instructions. One of those vectors, plasmid 
VEGFX/pCR2.1 1TOPO FL 

was deposited on 1 March 1999 under Accession No. 
35 LMBP 3925. After sequence determination, the inserts 
were cloned into the desired expression vectors (see 
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figures 19, 20, 21, 24 & 26) . 

A human genomic BAC library (Genome Systems, Inc., St 
Louis, Ml, USA) was screened by hybridisation to 
5 oligonucleotides derived from the VEGF-X cDNA 
sequence, according to the manufacturer's 
instructions, BAC DNA was prepared using a Qiagen 
plasmid midi kit (Qiagen GmbH, Dttsseldorf , Germany ) 
according to the manufacturer's instructions with 

10 some modifications (after clearing of the lysate from 
chromosomal DNA, supernatant s from individual 
preparations were pooled on a single column (tip 
100), and after the 70 % EtOH wash, the pellet was 
resuspended overnight at 4°C in 100 pi TE) . 20-rner 

IS sequencing primers were designed based on the known 
cDNA sequence, and sequencing carried out as above. 

5' RACE 

20 In order to extend the cDNA clone in a 5' direction 
RACE reactions were carried out. Since it was known 
that the mRNA is present in placenta and skeletal 
muscle, Marathon-Ready™ placenta and skeletal muscle 
cDNAs were purchased from Clontech (Palo Alto CA. 

25 -USA) and used according to the manufacturer's 
instructions. DNA fragments were excised from 
agarose gels, purified using QiaQuick PCR 
purification columns (Qiagen GmbH, Diisseldorf , 
Germany) and sequenced directly. 

30 

VEGF-X protein expression and puxification 
DNA fragments encoding the desired protein sequences 
were amplified by PCR and cloned into appropriate 
expression vector systems. 

35 

For mammalian cell expression, the full coding 
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sequence was cloned into the vector pcDNA6/V5-his 
(Invitrogen Leek/ NL, see figure 19 for construct 
sequence) , so as to add a C-terminal peptide tag to 
assist in detection and purification. 

5 

For insect cell expression the sequence of the 
predicted mature polypeptide was initially amplified 
to add an N-terminal 6His peptide and then cloned 
into the pMelBacB vector (Invitrogen/ Leek, NL) to 
10 add an insect cell signal sequence. The entire 

insert was then PCR-cloned into the vector pFASTBAC-1 
(LifeTechnologieS/ Gaithersburg, MA, USA) for 
construction of a baculovirus according to the 
manufacturer's instructions. 

15 

For E.coli expression^ the coding region was PCR 
amplified to add a C-terminal 6His tag and then 
cloned into the vector pMAL-p2 (New England BiolabS/ 
Beverly, MA, USA) . The coding sequence of this 
20 construct is shown in figure 21) , The protein was 
purified first on Ni-NTA resin (Qiagen GmbH, 
Diisseldorf, Germany) and then on amylose resin (New 
England Biolabs, Beverly/ MA, USA), according to the 
manufacturers' instructions. 

25 

DNA sequences encoding the CUB and VEGF domain 
fragments of VEGF-X were PCR amplified and cloned 
into pET22b and pET21a (Novagen, Madison, WI, USA) 
respectively. The CUB domain protein was prepared 

30 either from the periplasm or medium of induced 

cultures by standard methods (Ausubel et al, 199*7) . 
The protein was initially purified by precipitation 
with 20% ammonium sulphate. After overnight dialysis 
vs 20mM Tris Hcl pH7.5, lOOmM NaCl to remove ammonium 

35 sulphate, the protein was further purified on Ni-NTA 
resin as described above. The VEGF domain protein 
was expressed in insoluble form, and preparation of 
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inclusion bodies was carried out using standard 
procedures (Ausubel et al 1997) . Inclusion bodies 
were dissolved in 6M guanidine hydrochloride, 20mM 
Tris Hcl pH8.0, 200mM NaCl, IniM 2-mercaptoethanol, 
5 and purified on Ni-NTA resin (Qiagen GmbH, 

Diisseldorf, Germany) according to the manufacturer's 
instructions. The protein was refolded by dialysis 
against several .changes of buffer containing 
decreasing concentrations of denaturant. 

10 

Analysis of protein glycosylation was carried out 
using EndbH (Roche Molecular Biochemicals, Brussels, 
BE) according to the manufacturer's instructions. 

15 Cell Proliferation Assay 

Human umbilical vein endothelial cells (HUVECs) 
(Clonetics, San Diego, CA.) were trypsinized with 
0-05% trypsin/0. 53mM EDTA (Gibco, Gaithersburg, MD. ) , 
resuspended in the EGM-2 (Clonetics, San Diego, CA. ) , 

20 counted, and distributed in a 96-well tissue culture 
plate at 5,000 cells/well. Following cell attachment 
and monolayer formation (16 hours), cells were 
stimulated with various concentrations of truncated 
VEGF-X (CUB domain or VEGF domain) or dilutions of 

25 culture supernatants of the full-length VEGF-X (COS 7 
or HEK293) in DMEM (Gibco, Gaithersburg, MD.) 
containing 0.5% to 2% FBS (HyClone, Logan, UT) as 
indicated- For human fetal dermal fibroblasts 
(American Type Culture Collection, Rockville, MD. ) , 

JO the growth medium was replaced by DMEM containing 
0.1$ BSA (Sigma, St. Louise, MO.) with or without 
various concentrations of truncated VEGF-X proteins. 
For KCASMC (Clonetics, San Diego, CA.), the medium 
was replaced by DMEM containing 0.5% FBS. The cells 

>5 were treated for a further 24 hr-72 hr. For the 

measurement of proliferation, the culture media were 
replaced with 100 ul of DMEM containing 5% FBS and 3 



uCi/ml of [3H] -thymidine (Amersham, Arlington 
Heights, IL.). Following pulse labeling, cells were 
fixed with methanol /acetic acid (3:1, vol/vol) for 1 
hour at room temperature. The cells were washed 
twice with 250 yil/well 0 f 80% methanol. The cells 
were solubilized in 0.05% trypsin (lOOul/well) for 
30 minutes then in 0,5% SDS (100 ul/well) for another 
30 minutes. Aliquots of cell lysates (190 ul) were 
combined with 2 ml of scintillation cocktail (Fisher, 
Springfiled, NJ) and the radioactivity of cell 
lysates was measured using a liquid scintillation 
counter (Wallac 1409). In each case, samples were 
performed in quadruplicate. 

Chemotaxis Assay 

The chemotactic response of HUVECs was assayed using 
a 48-well modified Boyden chamber (NeuroProbe, Cabin 
John, MD.) and collagen-coated (0,lmg/ml type I 
collagen, Collaboratic Biomedical, Bedford, MA.) 
polycarbonate membrane filters with a pore diameter 
of 8 urn (NeuroProbe, Cabin John, MD.). Cell 
suspensions (15, 000/well) were loaded to the upper 
part of the chemotaxis chamber and stimulated for 4 
hours with rhVEGF 165 (0.1-10 ng/ml) (Calbiochem, San 
Diego, CA. ) or various concentrations of truncated 
VEGF-X (PDGF domain) . Cells remaining on the top of 
the membrane were removed. Migration was assessed by 
counting the number of cells that migrated to the 
lower side of the filter membrane. The membrane was 
fixed with 10% formaldehyde for 15 min, followed by 
staining with Gill's hemotoxylin III- (Poly 
Scientific, Bay Shore, NY. ) . The assay was performed 
in triplicates and six independent high power fields 
per well were counted using a light microscope at 250 
magnification. The results were expressed as the fold 
of unstimulated cells (EGM containing 0.1% BSA) . 
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Xji Vxtizo Angiogenesis Assay 

In vitro angiogenesis in fibrin gels was quantitated 
using spheroids of human umbilical vein endothelial 
cells (Korff et al., 1998). To generate endothelial 
cell spheroids of defined size and cell number/ a 
specific number of cells 800 cells per spheroid) 
was suspended in EGM-2 culture medium containing 20% 
methylcellulpse .(Sigma, St. "Louis, MO.), seeded into 
nonadherent round-bottom 96-well plates. All 
suspended cells in one well contributed to the 
formation of a single endothelial cell spheroid 
within 24 hours. A fibrin gel stock solution was 
prepared freshly prior to use by mixing. 3mg /ml 
fibrinogen (Calbiochem, San Diego, CA. ) in Medium 
199(Gibco, Gaithersburg, MD.). Assays were performed 
in 24-well culture plates. The 1ml fibrinogen stock 
was mixed with 50 HUVEC spheroids and the 
corresponding test substance including rh-VEGF X65 or 
various concentration of VEGF-X. The 
spheriod-containing fibrinogen was rapidly 
transferred into 24-well plates. Fifteen microliters 
of thrombin (100 NIH U/ml stock, Sigma, St. Louis, 
MO.) was added to the gel for the fibrin gel 
formation. The gel formation usually occurred within 
30 seconds. After gel formation, lml /well of Medium 
199 supplemented with 20% FBS, Img/ml e-aminocaproic 
acid (Calbiochem, San Diego, CA. ) and antibiotics 
were added. The gel was incubated at 37 °C (5%C0 3 , 95% 
air, 100% humidity) . After 3 days, in vitro 
angiogenesis was quantitated by measuring the length 
of the three longest capillary sprouts that had grown 
out of each spheroid (100X magnification) , analyzing 
at least 10 spheroids per experimental group and 
experiment . 



Matrigel Mouse Assay 
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The matrigel mouse assay is carried out as described 
by Passanti et al (1992). 

Analysis of VEGF-X gene expression by RT-PCR 
5 analysis . 

Oligonucleotide primers VEGF-E2 and VEGF-X14 (figure 
16; figure 5) were used for the specific PCR 
amplification of a 350 bp fragment from VEGF-X, PCR 
amplifications were performed on human multiple 
10 tissue cDNA (MTC™) panels (Clontech human MTC panels 
I and II and human Tumor MTC panel) normalised to the 
mRNA expression levels of six different housekeeping 
genes, in addition, cDNA was made from different 
tumor cell cultures (Caco-2 colorectal 
15 adenocarcinoma; T-84 colorectal carcinoma; MCF-7 
breast adenocarcinoma; T-47D breast ductal gland 
carcinoma; HT1080 bone fibrosarcoma; SaOS-2 
osteosarcoma; SK-N-MC neuroblastoma; HepG2 
hepatoblastoma; JURKAT T-cell leukemia and THP-1 
20 myelomonocytic leukemia) . For the preparation of 

tumor cell cDNA, cells were homogenised and total RNA 
prepared using the RNeasy Mini kit (Qiagen GmbH, 
Hilden, Germany) according to manufacturer's 
instructions. 1 pg of total RNA was reverse 
25 transcribed using oligo(dT)15 as a primer and 50 U of 
Expand™ Reverse Transcriptase (Boehringer Mannheim, 
Mannheim, Germany) according to the manufacturer's 
instructions. FCR reactions with VEGF-X-specif ic osr 
glyceraldehyde-3-phosphate dehydrogenase 
30 (G3PDH) -specific primers were then performed on 1 pi 
of this cDNA. For all cDNAs, PCR reactions with 
VEGF-X specific primers were performed in a total 
volume of 50 pi, containing 5 pi (±1 ng) of cDNA, lx 
Advantage KlenTaq PCR reaction buffer, 0.2 mM dNTP, 
35 250 nM of primers VEGF-E2 and VEGF-X14 and 1 pi of 

Advantage KlenTaq polymerase mix. Samples were heated 
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to 95°c for 30 s and cycling was done for 30 s at 
95°C and 30 s at 68°C fox 25, 30 or 35 cycles. 
Control reactions using specific primers that amplify 
a 1 kb fragment of the housekeeping gene G3PDH were 
also performed according to the manufacturer's 
instructions • 



Northern blot analysis of VEGF-X. 

Northern blots containing 2 *ig of poly (A) -rich RNA 
10 derived from different human tissues (Clontech 

Laboratories; MTN™ blot, MTN TW ' blot II and Cancer Cell 
Line MTN™ blot) were hybridized according to the 
manufacturers instructions with a a- [ 32 P] -dCTP 
random-priming labelled (Multiprime labelling kit, 
15 Roche Diagnostics) 293 bp specific VEGF-X fragment 
(PinAI-StuI fragment including 92 bp of the 3' end 
coding region and 201 bp of the 3' untranslated 
region of VEGF-X) . The blots were hybridized 
overnight at 68 °C and final washes at high stringency 
20 were at 66 °C in O.lx SSC/0.1 % SDS. The membranes 
were autoradiographed for 1 to 3 days with 
intensifying screens . 

Full length VEGF-X 

25 The effect of full length VEGF-X on proliferation of 
HuVEC cells was determined by the 3 H-Thymidine 
incorporation assay. HuVEC cells were serum starved 
for 24 hours prior to treatment with the full length 
VEGF-x at the concentration range from 100 pg/ml-10 

30 pg/ml. There was no effect of VEGF-X at 100 pg/ml-10 
ng/ml on endothelial cell proliferation. At the 
higher concentrations of FL -VEGF-X (100 ng/ml and 1 
pg/ml) there was a marked inhibition of endothelial 
cell proliferation. This is probably due to the very 
35 high endotoxin level in the samples . The VEGF-X 
sample was purified in order to decrease the 
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endotoxin level and is currently tested in the cell 
proliferation assay. 

The Summary from Testing the CUB Domain 
5 The effect of CUB domain on inhibition of HuVEC 

prolieration either serum- (2%), rh-VEGF or bFGF- 
stimulated, was assessed by the 3 H-Thymidine 
incorporation assay. Cells were serum starved 
followed by the treatment with the CUB domain and 

10 various growth factors. Results showed that the CUB 
domain inhibited endothelial cell proliferation, 
either serum- (2%), rh-VEGF or bFGF-stimulated in a. 
dose dependent manner with maximal inhibition at 10 
lig/ml. There was approximately a 2-fold inhibition 

15 of proliferation (at 10 yg/ml) of cells stimulated 

with VEGF and bFGF and nearly a 5-fold inhibition of 
cells stimulated with serum (2%). Results with the 
LDH assay showed that there was no cytotoxicity 
associated with the inhibition of cell proliferation 

20 by the CUB domain. 

Therefore, the N-terminus of the polypeptide from 
Figure 10 has been shown to possess a CUB domain. 
When database searches are carried out using the 

25 full-length coding sequence the best matches (i.e. 

for a BLAST search, those with the lowest probability 
score) are found with the CUB domain rather than with 
the VEGF-like domain. The best match from searching 
release 37 of the SWISSPROT database (Feb 1999) is to 

30 the CUB domain of a neuropilin from Xenopus laevis, 
and the matches to the CUB domains of human 
neuropilins 1 and 2 are also more significant than 
matches to the VEGFs - 

35 This similarity is provocative, given the 

identification of neuropilin-1 and -2 as cellular 
receptors for the VEGF- A 165 (Stoker et al. 1998, 
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reviewed in Neufeld et aJ. 1999). It is plausible 
therefore that VEGF-X could exert dual regulatory 
effects: via interaction with the tyrosine kinase 
VEGF-receptors mediated by the VEGF-like domain, as 
5 well as via interaction with VEGF isoforms or with 
the neurophilin receptors, mediated by the CUB 
domain. 

To the best of bur understanding the latter would be 
10 entirely novel, and searches on the most recent 
release of the Incyte database do not reveal any 
other proteins containing both CUB and VEGF-like 
domains. This arrangement of domains suggests 
possible positive or negative models of regulation: 

15 

Positive- the VEGF-like domain is able to interact 
productively with the tyrosine kinase VEGF receptors 
giving activation, and the CUB domain is able to 
interact productively with the neuropilin receptor 
20 giving activation. 

Negative- the VEGF-like domain does not interact 
productively with the tyrosine kinase VEGF receptors, 
either preventing receptor dimerisation or blocking 

25 the VEGF binding sites. Further, the CUB domain does 
not interact productively with the neuropilin 
receptors, either preventing receptor activation or 
blocking the VEGF binding sites, or indeed by binding 
to VEGF isoforms and preventing their interaction 

30 with receptors. 
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TABLE 1 



15 



ORIGINAL RESIDUE 


EXEMPLARY 


SUBSTITUTIONS 


ALA 


SER, 


THR 




ARG 


LYS 






ASN 


HIS, 


SER 




ASP 


GLU/ 


ASN 




CYS 


SER 






GLN 


ASN, 


HIS 




GLU 


ASP 


GLU 

\SJU\J 




GLY 


ALA 


o i_j r\ 




HIS 




GLN 




ILE 


LEU, 


VAL, 


THR 


LEU 


ILE, 


VAL 




LYS 


ARG, 


GLN/ 


GLU/ THR 


MET 


LEU , 


ILE, 


VAL 


PHE 


LEU/ 


TYR 




SER 


THR/ 


ALA, 


ASN 


THR 


SER, 


■ALA 




TRP 


ARG, 


SER 




TYR 


PHE 






VAL 


ILE/ 


LEU ALA 


PRO 


ALA 
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SEQUENCE LISTING 



Sequence ID No 1 



corresponds to the amino acid 
sequence from position 23 to 345 
of the amino acid sequence 
illustrated in Figure 10. 



10 



15 



20 



25 



30 



35 



Sequence ID No. 2 is the amino acid sequence 

illustrated in Figure 10. 



Sequence ID No 3 



Sequence ID No 4 



Sequence ID No 5 



Sequence ID No 6 



Sequence ID No 7 



Sequence ID No 8 



Sequence ID No 9 



corresponds to the sequence from 
position 257 to 1291 of the 
nucleotide sequence illustrated 
in Figure 9. 

corresponds to the polynucleotide 
sequence of VEGFX1 illustrated in 
Figure 3, 

corresponds to the polynucleotide 
sequence of VEGFX2 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX3 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX4 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX5 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX6 illustrated in 
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Sequence ID No 10 

5 

Sequence ID No 11 

10 

Sequence ID No 12 
15 Sequence ID No 13 
Sequence ID No 14 

20 

Sequence ID No 15 

25 

Sequence ID No 16 

30 

Sequence ID No 17 
35 Sequence ID No 18 



Figure 3 . 

corresponds to the polynucleotide 
sequence of VEGFX7 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX8 illustrated in 
Figure 3* 

corresponds to the polynucleotide 
sequence of VEGFX9 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX10 illustrated 
in Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFXll illustrated 
in Figure 4. 

corresponds to the polynucleotide 
sequence of VEGFX12 illustrated 
in Figure 4 . 

corresponds to the polynucleotide 
sequence of VEGFX13 illustrated 
in Figure 4 . 

corresponds to the polynucleotide 
sequence of VEGFX14 illustrated 
in Figure 4 . 

corresponds to the polynucleotide 
sequence 5'-l in Figure 8. 
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Sequence ID No 19 
Sequence ID No 20 

5 

Sequence ID No 21 

10 

Sequence ID No 22 

15 

Sequence ID No 23 
20 Sequence ID No 24 
Sequence ID No 25 

25 

Sequence ID No 26 

30 

Sequence ID No 27 

35 

Sequence ID No 28 
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corresponds to the polynucleotide 
sequence 5 , -2 in Figure 8. 

corresponds to the polynucleotide 
sequence of VEGFX6 illustrated in 
Figure 13. 

corresponds to the polynucleotide 
sequence of VEGFX7 illustrated in 
Figure 13 . 

corresponds to the polynucleotide 
sequence of VEGFX8 illustrated in 
Figure 13. 

corresponds to the polynucleotide 
sequence of VEGFX9 illustrated in 
Figure 13* 

corresponds to the polynucleotide 
sequence of VEGBAC1 illustrated 
in Figure 13. 

corresponds to the polynucleotide 
sequence of VEGBAC2 illustrated 
in Figure 13, 

corresponds to a polypeptide 
having the amino acid sequence 
from amino acid position 40 to 
150 of the sequence of Figure 10. 

corresponds to a polypeptide 
having the amino acid sequence 
illustrated in Figure 26. 

corresponds to the sequence from 
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5 Sequence ID No 29 



10 Sequence ID No 30 



position 5 to 508 of the 
nucleotide sequence illustrated 
in Figure 26. 

corresponds to the nucleotide 
sequence from position 5 to 508 
of the nucleotide sequence 
illustrated in Figure 26. 

corresponds to the sequence from 
position. 2-1 4 to 345 of the 
nucleotide sequence illustrated 
in Figure 10. 
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CIA IMS 



1. A nucleic acid molecule encoding a VEGF-X 
protein or a functional equivalent, derivative or 
5 bioprecursor thereof, said protein comprising any of 
the sequences from position 23 to 34 5 of the amino 
acid sequence illustrated in Figure 10, or the 
complete sequence as illustrated in Figure 10. 

10 2 - A nucleic acid molecule according to claim 1 
wherein said nucleic acid is a DNA molecule. 

3. A nucleic acid molecule according to claim 1 or 
2 wherein said nucleic acid is a cDNA molecule. 

15 

4. A nucleic acid molecule according to claim 3 
comprising the nucleotide sequence from position 257 
to 1291 of the nucleotide sequence illustrated in 
Figure 9, or sequences that hybridise thereto under 
high stringency conditions or the complement thereto. 

5. An antisense molecule capable of hybridising to 
a molecule according to any of claims 1 to 4 under 
high stringency conditions. 

6. A nucleic acid molecule according to any of 
claims 1 to 4 which is of mammalian origin. 

7. A nucleic acid molecule according to claim 6 
30 which is of human origin. 

8. An isolated VEGF-X protein, or a functional 
equivalent, derivative or bioprecursor thereof, 
having an amino acid sequence from position 23 to 345 

35 of the amino acid sequence illustrated in Figure 10 
or the complete amino acid sequence of Figure 10. 



20 



25 
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9. A VEGF-X protein, or a functional equivalent, 
derivative or bioprecusor thereof, encoded by a 
nucleic acid molecule as defined in any of claims 1 
to 4. 

5 

10. A protein according to claim 9, which comprises 
the amino acid sequence illustrated in Figure 10. 

11- An expression vector comprising a nucleic acid 
10 molecule according to any of claims 1 to 4. 

12, An expression vector according to claim 11 
further comprising a nucleotide sequence encoding a 
reporter molecule. 



15 



20 



25 



30 



35 



13* An expression vector comprising an antisense 
molecule according to claim S. 

14. A nucleic acid molecule according to any of 
claims 1 to 4 or an antisense molecule according to 
claim 5 for use as a medicament, 

15. A host cell transformed or transfected with an 
expression vector according to claim 11 or 12. 

16. A host cell transformed or transfected with an 
expression vector according to claim 13. 

17. A transgenic cell, tissue or organism comprising 
a transgene capable of expressing a VEGF-X protein 
according to claim 8 or 9. 

18. A transgenic cell, tissue or organism according 
to claim 17, wherein said transgene is included in an 
expression vector. 



19. A VEGF-X protein or a functional equivalent. 
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derivative or bioprecursor thereof, expressed- by a 
cell according to claim 15. 

20. A VEGF-X protein, or a functional equivalent, 
5 derivative or bioprecursor thereof, expressed by a 

transgenic cell, tissue or organism according to 
claim 17. 

21. A process for producing a VEGF-X protein 

10 according to any of claims 8 to 10, said process 

comprising transforming a host cell or organism with 
an expression vector according to claim 11, and 
recovering the expressed protein from said host cell 
or organism. 

15 

22. An antibody capable of binding to a protein 
according to any of claims 8 to 10, or an epitope 
thereof. 

20 23. An antibody according to claim 22 for use as a 
medicament. 

24. A pharmaceutical composition comprising an 
antibody according to claim 22 together with a 

IS pharmaceutical^ acceptable carrier diluent or 
excipient thereof. 

25. A method of identifying VEGF-X protein in a 
sample which method comprises contacting said sample 

0 with an antibody according to claim 22 and monitoring 
for binding of any protein to said antibody. 

26. A kit for identifying the presence of VEGF-X 
protein in a sample which comprises an antibody 

5 according to claim 22 and means for contacting said 
antibody with said sample. 



27. A method of identifying compounds which modulate 
angiogenesis which method comprises providing a host 
cell or organism according to claim 15 or a 
transgenic cell, tissue or organism according to 
claim 17, contacting a test compound with said cell, 
tissue or organism and monitoring for an effect of 
said compound on said VEGF compared to a host cell or 
organism according to claim 15 or a transgenic cell 
tissue or organism according to claim 17 which has 
not been contacted with said compound. 

28. A compound identifiable according to the method 
. of claim 27. 

29. A compound according to claim 28 for use as a 
medicament. 

30. A nucleic acid sequence comprising the 
nucleotide sequences illustrated in any of Figures 3, 
5, 8 or 13. 

31. A method for producing a polypeptide, said 
method comprising the steps of: 

a) culturing the host cell of claim 15 under 
conditions suitable for expression of the 
polypeptide; and 

b) recovering the polypeptide from the host 
cell culture. 

32. A method of inhibiting angiogenic activity and 
inappropriate vascularisation including formation and 
proliferation of new blood vessels, growth and 
development of tissues, tissue regeneration and organ - 
and tissue repair in a subject said method comprising 
administering to said subject an amount of an 
antisense molecule according to claim 5 in sufficient 
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concentration to reduce or prevent said angiogenic 
activity. 

33. A method of inhibiting angiogenic activity or 
5 inappropriate vascularisation including any of 

formation and proliferation of new blood vessels, 
growth arid development of tissues, tissue 
regeneration and organ and tissue repair in a subject 
said method comprising administering to said subject 
10 an amount of an antibody according to claim 22 in 
sufficient concentration to reduce or prevent said 
angiogenic activity or inappropriate vascularisation. 

34. A method of inhibiting angiogenic activity or 
15 inappropriate vascularisation including any of 

formation and proliferation of new blood vessels, 
growth and development of tissues, tissue 
regeneration and organ and tissue repair in a 
subject, said method comprising implanting in said 
20 subject cells that express an antibody according to 
claim 22. 

35. A method of treating or preventing any of 
cancer, rheumatoid arthritis, psoriasis and diabetic 

25 retinopathy, said method comprising administering to 
said subject an amount of an antisense molecule 
according to claim 5 in sufficient concentration to 
treat or prevent said disorders. 

30 36. A method of treating or preventing any of 

cancer, rheumatoid arthritis, psoriasis and diabetic 
retinopathy, said method comprising administering to 
said subject an amount of an antibody according to 
claim 22 in sufficient concentration to reduce or 

35 prevent said disorders. 

37 . A method of promoting angiogenic activity or 
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10 



vascularisation to promote wound healing, skin graft 
growth, tissue repair, proliferation of new blood 
vessels, tissue regeneration and organ repair which 
method comprises applying or delivering to a site of 
interest a therapeutically effective amount of any of 
a group selected from a protein according to claim 8 
and a nucleic acid molecule encoding a VEGF-X protein 
or a functional equivalent, derivative or 
bioprecursor thereof comprising an amino acid 
sequence illustrated in Figure 10, an expression 
vector comprising said nucleic acid molecule and a 
pharmaceutical composition comprising any of said 
nucleic acid molecule and said protein. 

15 38. A method of treating wounds selected from the 
group consisting of dermal ulcers, pressure sores, 
venous sores, diabetic ulcers and bums by applying 
to said wound a therapeutically effective amount of 
any of a VEGF-X protein according to claim 8, a 

20 pharmaceutical composition comprising said protein 

and a pharmaceutical^ acceptable carrier, diluent or 
excipient therefor. 

39. A nucleic acid molecule encoding a polypeptide 
25 having a CUB domain said polypeptide comprising the 
amino acid sequence from position 40 to 150 of the 
sequence of Figure 10 . 



30 



35 



40. A nucleic acid molecule encoding a polypeptide 
having a CUB domain, said polypeptide comprising the 
amino acid sequence of Figure 26. 

41. A nucleic acid molecule according to claim 39 or 
40, comprising the nucleotide sequence from position 
5 to 508 of the sequence illustrated in Figure 26. 

42. A nucleic acid molecule according to any of 
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claims 39 to 41 comprising the nucleotide sequence 
illustrated in Figure 26. 

43. A nucleic acid molecule encoding a VEGF like 

5 domain comprising the sequence from position 214-34 5 
of the sequence of Figure 10 or the sequence from 
position 15 to 4 61 illustrated in Figure 24. 

44. An expression vector comprising a nucleic acid 
10' molecule according to any of claims 39 to 42. 

45. An expression vector comprising a nucleic acid 
molecule according to claim 43. 

15 46. A host cell transformed or transfected with an 
expression vector according to claim 44. 

47. A host cell transformed or transfected with an 
expression vector according to claim 45. 

20 

48. A protein expressed by the cell according to 
claim 46. 

49. A protein expressed by the cell according to 
25 claim 47. 

50. A method of identifying compounds that inhibit 
or enhance angiogenic activity, said method 
comprising contacting a cell expressing a VEGF 

30 receptor and/or a neuropilin 1 or 2 type receptor 
with said compound in the presence of a VEGF-X 
protein according to claim 8 and monitoring for the 
effect of said compound or said cell when compared to 
a cell which has not been contacted with said 

35 compound. 

51 . A compound identifiable according to the method 
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of claim 50 as an inhibitor or enhancer of angiogenic 
activity. 

52. A method of inhibiting angiogenic activity or 
inappropriate vascularisation, said method comprising 
contacting a cell expressing, a VEGF receptor and a 
neuropilin type receptor with a protein selected from 
any of a protein according to any of claims 8 to 10 
and a protein according to claim 4 8 or a protein 
according to claim 49. 

53. Use of a nucleotide sequence illustrated in any 
of Figures 14 and 15 in identifying a VEGF-X protein 
according to claim 8. 

54. A nucleic acid molecule encoding a polypeptide 
comprising a CUB domain having the sequence from 
position 4 0 to 150 of the sequence of Figure 10 or 
from position 5 to 508 of the sequence of Figure 2 6 
and a sequence encoding a VEGF domain. 

55. A nucleic acid molecule according to claim 54 
wherein said sequence encoding said VEGF domain is 
selected from the sequences encoding any of VEGF A to 
D or isoforms or variants thereof. 

56. A nucleic acid molecule encoding a polypeptide 
comprising the amino acid sequence from position 40 
to 150 of the sequence illustrated in Figure 10 for 
use as a medicament. 

57. use of a nucleic acid molecule encoding a 
polypeptide having the amino acid sequence from 
position 4 0 to 150 of the sequence illustrated in 
Figure 10 in the manufacture of a medicament for 
treatment of disease conditions associated with 
inappropriate angiogenesis such as tumour or cancer 
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growth, retinopathy, osteoarthritis or psoriasis. 

58. A polypeptide comprising the amino acid sequence 
from position 40 to ISO of the sequence illustrated 
in figure 10 for use as a medicament. 

59. A polypeptide comprising the amino acid sequence 
from position 40 to 150 of the sequence illustrated 
in Figure 10 in the manufacture of a medicament for 
the treatment of disease conditions associated with 
inappropriate angiogenesis such as tumour growth, 
retinopathy, osteoarthritis or psoriasis. 

60. Use of a CUB domain comprising the amino acid 
sequence from position 40 to 150 of the sequence of 
Figure 10, or the amino acid sequence of Figure 26, 
to identify compounds which inhibit angiogenic 
activity in a method according to claim 50. 

61. A method of inhibiting angiogenic activity and 
inappropriate vascularisation including formation and 
proliferation of new blood vessels, growth and 
development of tissues, tissue regeneration and organ 
and tissue repair in a subject said method comprising 
administering to said subject an amount of a 
polypeptide having an amino acid sequence from 
position 40 to 150 of the sequence illustrated in 
Figure 10 or a nucleic acid molecule according to any 
of claims 39 to 42 in sufficient concentration to 
reduce or prevent said angiogenic activity. 

62. A method of treating or preventing any of 
cancer, rheumatoid arthritis, psoriasis and diabetic 
retinopathy, said method comprising administering to 
said subject an amount of a polypeptide having an 
amino acid sequence from position 4 0 to 150 of the 
sequence illustrated in Figure 10 or a nucleic acid 
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molecule according to any of claims 39 to 42 in 
sufficient concentration to treat or prevent said 
disorders . 

5 63. An antisense molecule capable of hybridising to 
a molecule according to any of claims 39 to 42 under- 
high stringency conditions. 

64. An antisense molecule capable of hybridising to 
10 a molecule according to claim 43 under high, 
stringency conditions. 

.65. A transgenic cell, tissue or organism comprising 
a transgene capable of expressing a protein according 
15 to claim 48. 

66. A transgenic cell, tissue or organism comprising 
a transgene capable of expressing a protein according 
to claim 49. 

20 

67* A transgenic, cell tissue or organism 
according to claim 65 or 66, wherein said transgene 
is included in an expression vector according to 
claim 41 or 42. 

25 

68. An antibody capable of binding to a protein 
according to claim 48 or an epitope thereof . 

69. An antibody capable of binding to a protein 
30 according to claim 4 9 or an epitope thereof. 

70. A pharmaceutical composition comprising an 
antibody according to claim 68 or 69 together with a 
pharmaceutical^ acceptable carrier diluent or 

35 excipient therefor. 



71. A pharmaceutical composition comprising a 
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compound according to claim 4 8 together with a 
pharmaceutical^ acceptable carrier, diluent or 
excipient therefor. 

72. A nucleic acid molecule encoding a variant of a 
VEGF-x protein having any of the sequences of 
nucleotides illustrated in Figure 12. 
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1 AAAATGTATG GATACAACTT AC GTTTG ATG AAAGATTTGG GCTTGAAGAC CCAGAAGATG 
TTTTACATAC CTATGTTGAA TGCAAACTAC TTTCTAAACC CGAACTTCTG GGTCTTCTAC 

61 ACATATGCAA GTATGATTTT GTAGAAGTTG AGGAACCCAG TGATGGAACT ATATTAGGGC 
TGTATACGTT CATACTAAAA CATCTTCAAC TCCTTGGGTC ACTACCTTGA TATAATCCCG 

121 GCTGGTGTGG TTCTGGTACT GTACCAGGAA AACAGATTTC TAAAGGAAAT CAAATTAGGA 
CGACCACACC AAGACCATGA CATGGTCCTT TTGTCTAAAG ATTTCCTTTA GTTTAATCCT 

+1 MetAsn IlePheLeu LeuAsnLeuLeu ThrGluGlu ValArgLeu 

] 

181 TAAGATTTGT ATCTGATGAA TATTTTCCTT CTGAACCTTC TAACAGAGGA GGTAAGATTA 
ATTCTAAACA TAGACTACTT ATAAAAGGAA GACTTGGAAG ATTGTCTCCT CCATTCTAAT 

+1 TyrSerCysThr ProArgAsn PheSerVal SerlleArgGlu GluLeuLys ArgThrAsp 



241 TACAGCTGCA CACCTCGTAA CTTCTCAGTG TCCATAAGGG AAGAACTAAA GAGAACCGAT 
ATGTCGACGT GTGGAGCATT GAAGAGTCAC AGGTATTCCC TTCTTGATTT CTCTTGGCTA 

+1 ThrllePheTrp ProGlyCys LeuLeuVal LysArgCysGly GlyAsnCys AlaCysCys 



301 ACCATTTTCT GGCCAGGTTG TCTCCTGGTT AAACGCTGTG GTGGGAACTG TGCCTGTTGT 
TGGTAAAAGA CCGGTCCAAC AGAGGACCAA TTTGCGACAC CACCCTTGAC ACGGACAACA 

+1 LeuHisAsnCys AsnGluCys GlnCysVal ProSerLysVal ThrLysLys TyrHisGlu 



3 61 CTCCACAATT GCAATGAATG TCAATGTGTC CCAAGCAAAG TTACTAAAAA ATACCACGAG 
GAGGTGTTAA CGTTACTTAC AGTTACACAG GGTTCGTTTC AATGATTTTT TATGGTGCTC 

+1 ValLeuGlnLeu ArgProLys ThrGlyVal ArgGlyLeuHis LysSerLeu ThrAspVal 



421 GTCCTTCAGT TG AG AC C AAA GACCGGTGTC AGGGGATTGC ACAAATCACT CACCGACGTG 
CAGGAAGTCA ACTC TGGTTT CTGGCCACAG TCCCCTAACG TGTTTAGTGA GTGGCTGCAC 

+1 AlaLeuGluHis HisGluGlu CysAspCys ValCysArgGly SerThrGly Gly 



481 GCCCTGGAGC ACCATGAGGA GTGTGACTGT GTGTGCAGAG GGAGCACAGG AGGATAGCCG 

CGGGACCTCG TGGTACTCCT CACACTGACA CACACGTCTC CCTCGTGTCC TCCTATCGGC 

541 CATCACCACC AGCAGCTCTT GCCCAGAGCT GTGCAGTGCA GTGGCTGATT CTATTAGAGA 

GTAGTGGTGG TCGTCGAGAA CGGGTCTCGA CACGTCACGT CACCGACTAA GATAATCTCT 

601 ACGTATGCGT TATCTCCATC CTTAATCTCA GTTGTTTGCT TCAAGGACCT TTCATCTTCA 

TGCATACGCA ATAGAGGTAG GAATTAGAGT CAACAAACGA AGTTCCTGGA AAGTAGAAGT 

661 GGATTTACAG TGCATTCTGA AAGAGGAGAC ATCAAACAGA ATTAGGAGTT GTGGAACAGC 

CCTAAATGTC ACGTAAGACT TTCTCCTCTG TAGTTTGTCT TAATCCTCAA CACGTTGTCG 

721 TCTTTTGAGA GGAGGCCTAA AGGACAGGAG AAAAGGTCTT CAATCGTGGA AAGAAAATTA 

AGAAAACTCT CCTCCGGATT TCCTGTCCTC TTTTCCAGAA GTTAGCACCT TTCTTTTAAT 

781 AATGTTGTAT TAAATAGATC ACCAGCTAGT TTCAGAGTTA CCATGTACGT ATTCCACTAG 

TTACAACATA ATTTATCTAG TGGTCGATCA AAGTCTCAAT GGTACATGCA TAAGGTGATC 
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8 41 CTGGGTTCTG TATTTCAGTT CTTTCGATAC GGCTTAGGGT AATGTCAGTA CAGGAAAAAA 

GACCCAAGAC ATAAAGTCAA GAAAGCTATG CCGAATCCCA TTACAGTCAT GTCC TTTTTT 

901 ACTGTGCAAG TGAGCACCTG ATTCCGTTGC CTTGCTTAAC TCTAAAGCTC CATGTCCTGG 
TGACACGTTC ACTCGTGGAC TAAGGCAACG GAACGAATTG AGATTTCGAG GTACAGGACC 

9 61 GCCTAAAATC GTATAAAATC TGGATTTTTT TTTTTTTTTT TGCTCATATT CACATATGTA 

CGGATTTTAG CATATTTTAG ACCTAAAAAA AAAAAAAAAA ACGAGTATAA GTGTATACAT 

1021 AACCAGAACA TTCTATGTAC TACAAACCTG GTTTTTAAAA AGGAACTATG TTGCTATGAA 
TTGGTCTTGT AAGATACATG ATGTTTGGAC CAAAAATTTT TCCTTGATAC AACGATACTT 

10 81 TTAAACTTGT GTCGTGCTGA TAGGACAGAC TGGATTTTTC ATATTTCTTA TTAAAATTTC 
AATTTGAACA CAGCACGACT ATCCTGTCTG ACCTAAAAAG TATAAAGAAT AATTTTAAAG 

1141 TGCCATTTAG AAGAAGAGAA C TACATTC AT GGTTTGGAAG AGATAAACCT GAAAAGAAGA 
ACGGTAAATC TTCTTCTCTT GATGTAAGTA CCAAACCTTC TCTATTTGGA CTTTTCTTCT 

12 01 GTGGCCTTAT CTTCACTTTA TCGATAAGTC AGTTTATTTG TTTCATTGTG TACATTTTTA 
CACCGGAATA GAAGTGAAAT AGCTATTCAG TCAAATAAAC AAAGTAACAC ATGTAAAAAT 

12 61 TATTCTCCTT TTGACATTAT AACTGTTGGC TTTTCTAATC TTGTTAAATA T ATC TATTTT 

ATAAGAGGAA AACTGTAATA TTGACAACCG AAAAGATTAG AACAATTTAT ATAGATAAAA 

1321 TACCAAAGGT ATTTAATATT CTTTTTTATG ACAACTTAGA TCAACTATTT TTAGCTTGGT 
ATGGTTTCCA TAAATTATAA GAAAAAATAC TGTTGAATCT AGTTGATAAA AATC GAACC A 

13 81 AAATTTTTCT AAACACAATT GTTATAGCCA GAGGAACAAA GATGATATAA AATATTGTTG 

TTTAAAAAGA TTTGTGTTAA CAATATCGGT CTCCTTGTTT CTACTATATT TTATAACAAC 

1441 CTCTGACAAA AATACATGTA TTTCATTCTC GTATGGTGCT AGAGTTAGAT TAATCTGCAT 
GAGACTGTTT TTATGTACAT AAAGTAAGAG CATACCACGA TCTCAATCTA ATTAGACGTA 

1501 TTTAAAAAAC TGAATTGGAA TAGAATTGGT AAGTTGCAAA GACTTTTTGA AAATAATTAA 
AAATTTTTTG ACTTAACCTT ATCTTAACCA TTCAACGTTT CTGAAAAACT TTTATTAATT 

1561 ATTATCATAT CTTCCATTCC TGTTATTGGA GATGAAAATA AAAAGCAACT TATGAAAGTA 
TAATAGTATA GAAGGTAAGG ACAATAACCT CTACTTTTAT TTTTCGTTGA ATACTTTCAT 

1621 GACATTCAGA TCCAGCCATT ACTAACCTAT TCCTTTTTTG GGGAAATCTG AGCCTAGCTC 
CTGTAAGTCT AGGTCGGTAA TGATTGGATA AGGAAAAAAC CCCTTTAGAC TCGGATCGAG 

1681 AGAAAAACAT AAAGCACCTT GAAAAAGACT TGGCAGCTTC CTGATAAAGC GTGCTGTGCT 
TCTTTTTGTA TTTCGTGGAA CTTTTTCTGA ACCGTCGAAG GACTATTTCG CACGACACGA 

1741 GTGCAGTAGG AACACATCCT ATTTATTG TG ATGTTGTGGT TTTATTATCT TAAACTCTGT 
CACGTCATCC TTGTGTAGGA TAAATAACAC TACAACACCA AAATAATAGA ATTTGAGACA 

1801 TCCATACACT TGTATAAATA CATGGATATT TTTATGTACA GAAGTATGTC TCTTAACCAG 
AGGTATGTGA ACATATTTAT GTACCTATAA AAATACATGT CTTCATACAG AGAATTGGTC 

18 61 TTCACTTATT GTACCTGG 
AAGTGAATAA CATGGACC 
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S^S^ ^ Predicted VEGF-like protein encoded by Incyte contig of 8/12/98 

1 MNIFLLNLLT ESVRLYSCTP RNFSVSIR2E LKRTDTIFWP GCLLVKRCCG 
51 NCACCLHNCN ECQCVPSKVT KKYH3VLQLR PKTGVRGLHX SLTDVALEHH 
101 EECDCVCRGS TGG 



y^P^£ PCR primers for cloning VEGF-X 

vegfXl AAAATGTATGQATACAACTTAC 

vegfX2 GTTTGATGAAAGATTTGGGCTTG 

vegfX3 TTTCTAAAGGAAATCAAATTAG 

vegfX4 GATAAGATTTGTATCTGATG 

vegfXS GATGTCTCCTCTTTCAG 

vegfX6 GCACAACTCCTAATTCTG 

vegfX7 AGCACCTGATTCCGTTGC 

vegfX8 TAGTACATAGAATGTTCTGG 

vegfX9 AAGAGACATACTTCTGTAC 

vegfX 1 0 C CAG GT AC AAT AAGTG AACTG 
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S^?**^ Variants isolated by PCR (at 8/2/99, all cloned and sequenced at 



JRF) 



a b c d e f 

PCR primers- — > -» «_ , . 



Incyte contig i l '^ Vv r 
(8/12/98) 



clone 22, 29, 41 



clone 52, 59 



clones 57, 25, 
26, 27 



2. 1kb clones 1, £ 
2,3 



3^ 



clone 15,20 | Y ^^V 



primers- a- v=gfX1 b- vegfX2 c- vegfX5 

(see fig 3) d- vegfXo e- vegfX9 f- vegfXI 0 
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/V^P.^S. VEGF-X 5' RACE primers 

vegfXl 1 CCTTTAGAAATCTGTTTTCCTGGTACAG 
vegfXl2 GGAAAATATTCATCAGATACAAATCTTATCC 
vegfX13 GGTCCAGTGGCAAAGCTGAAGG 
vegfXl4 CTGGTTCAAGATATCGAATAAGGTCTTCC 
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DNA sequence assembled from in-house clones and 5'RACE 



1 TGCCAGAGCA GGTGGGCGCT TCCACCCCAG TGCAGCCTTC CCCTGGCGGT GGTGAAAGAG 
ACGGTCTCGT CCACCCGCGA AGGTGGGGTC ACGTCGGAAG GGGACCGCCA CCACTTTCTC 

61 ACTCGGGAGT CGCTGCTTCC AAAGTGCCCG CCGTGAGTGA GCTCTCACCC CAGTCAGCCA 
TGAGCCCTCA GCGACGAAGG TTTCACGGGC ' GGCACTCACT CGAGAGTGGG GTCAGTCGGT 

+ 2 MetSerLeu PheGlyLeuLeu LeuLeuThr SerAlaLeu AlaGlyGlnArg GlnGlyTh 

] 

121 AATGAGCCTC TTCGGGC TTC TCCTGCTGAC ATCTGCCCTG GCCGGCCAGA GACAGGGGAC 
TTACTCGGAG AAGCCCGAAG AGGACGACTG TAGACGGGAC CGGCCGGTCT CTGTCCCCTG 

+2 rGlnAlaGlu SerAsnLeuSer SerLysPhe GlnPheSer SerAsnLysGlu GlnAsnGl 



181 TCAGGCGGAA TCCAACCTGA GTAGTAAATT CCAGTTTTCC AGCAACAAGG AACAGAACGG 
AGTCCGCCTT AGGTTGGACT CATCATTTAA GGTCAAAAGG TCGTTGTTCC TTGTCTTGCC 

+2 yValGlnAsp ProGlnHisGlu Argllelle ThrValSer ThrAsnGlySer IleHisSe 



241 AGT AC AAG AT CCTCAGCATG AGAGAATTAT TACTGTGTCT ACTAATGGAA GTATTCACAG 
TCATGTTCTA GGAGTCGTAC TCTCTTAATA ATGACAC AG A TGATTACCTT CATAAGTGTC 

+2 rProArgPhe ProHisThrTyr ProArgAsn ThrValLeu ValTrpArgLeu ValAlaVa 



301 CCCAAGGTTT CCTCATACTT ATCCAAGAAA TACGGTCTTG GTATGGAGAT TAGTAGCAGT 
GGGTTCCAAA GGAGTATGAA TAGGTTCTTT ATGCCAGAAC CATACCTCTA ATCATCGTCA 

+2 lGluGluAsn ValTrpIleGln LeuThrPhe AspGluArg PheGlyLeuGlu AspProGl 



3 61 AGAGGAAAAT GTATGGATAC AACTTACGTT TGATGAAAGA TTTGGGCTTG AAGACCCAGA 
TCTCC TTTTA CATACCTATG TTGAATGCAA ACTACTTTCT AAACCCGAAC TTCTGGGTCT 

+2 uAspAspIle CysLysTyrAsp PheValGlu ValGluGlu ProSerAspGly ThrlleLe 



421 AGATGACATA TGCAAGTATG ATTTTGTAGA AGTTGAGGAA CCCAGTGATG GAACTATATT 
TCTACTGTAT ACGTTCATAC TAAAACATCT TCAACTCCTT GGGTCACTAC CTTGATATAA 

+2 uGlyArgTrp CysGlySerGly ThrValPro GlyLysGln IleSerLysGly AsnGlnll 



4 81 AGGGCGCTGG TGTGGTTCTG GTACTGTACC AGGAAAACAG ATTTCTAAAG GAAATCAAAT 
TCCCGCGACC AC AC C AAG AC CATGACATGG TCCTTTTGTC TAAAGATTTC CTTTAGTTTA 

+2 eArglleArg PheValSerAsp GluTyrPhe ProSerGlu ProGlyPheCys IleHisTy 



541 TAGGATAAGA TTTGTATCTG ATGAATATTT TCCTTCTGAA CCAGGGTTCT GCATCCACTA 
ATCCTATTCT AAACATAGAC TACTTATAAA AGGAAGACTT GGTCCCAAGA CGTAGGTGAT 

+2 rAsnlleVal MetProGlnPhe ThrGluAla ValSerPro SerValLeuPro ProSerAl 



601 CAACATTGTC ATGCCACAAT TCACAGAAGC TGTGAGTCCT TCAGTGCTAC CCCCTTCAGC 
GTTGTAACAG TACGGTGTTA AGTGTCTTCG ACACTCAGGA AGTCACGATG GGGGAAGTCG 

+2 aLeuProLeu AspLeuLeuAsn AsnAlalle ThrAlaPhe SerThrLeuGlu AspLeuIl 



6 61 TTTGCCACTG GACCTGCTTA ATAATGCTAT AACTGCCTTT AGTACCTTGG AAGACCTTAT 
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+ 2 eArgTyrLeu GluProGluArg TrpGlnLeu AspLeuGlu AspLeuTyrArg ProThrTr 



721 TCGATATCTT GAACCAGAGA GATGGCAGTT GGACTTAGAA GATCTATATA GGCCAACTTG 
AGCTATAGAA CTTGGTCTCT CTACCGTCAA CCTGAATCTT C T AG ATAT AT CCGGTTGAAC 

+2 pGlnLeuLeu GlyLysAlaPhe ValPheGly ArgLysSer ArgValValAsp LeuAsnLe 



781 GCAACTTCTT GGCAAGGCTT TTGTTTTTGG AAGAAAATCC AGAGTGGTGG ATCTGAACCT 
CGTTGAAGAA CCGTTCCGAA AACAAAAACC TTCTTTTAGG TCTCACCACC TAGACTTGGA 

+ 2 uLeuThrGlu GluValArgLeu TyrSerCys ThrProArg AsnPheSerVal SerlleAr 



841 TCTAACAGAG GAGGTAAGAT TAT AC AG CTG CACACCTCGT AACTTCTCAG TGTCCATAAG 
AGATTGTCTC CTCCATTCTA ATATGTCGAC GTGTGGAGCA TTGAAGAGTC ACAGGTATTC 

+2 gGluGluLeu LysArgThrAsp ThrllePhe TrpProGly CysLeuLeuVal LysArgCy 



901 GGAAGAACTA AAGAGAACCG ATACCATTTT CTGGCCAGGT TGTCTCCTGG TTAAACGCTG 
CCTTCTTGAT TTCTCTTGGC TATGGTAAAA GACCGGTCCA ACAGAGGACC AATTTGC G AC 

+2 sGlyGlyAsn CysAlaCysCys LeuHisAsn CysAsnGlu CysGlnCysVal ProSerLy 



9 61 TGGTGGGAAC TGTGCCTGTT GTCTCCACAA TTGCAATGAA TGTCAATGTG TCCCAAGCAA 
ACCACCCTTG ACACGGACAA CAGAGGTGTT AACGTTACTT ACAGTTACAC AGGGTTCGTT 

+2 sValThrLys LysTyrHisGlu ValLeuGln LeuArgPro LysThrGlyVal ArgGlyLe 



1021 AGTTACTAAA AAATACCACG AGGTCCTTCA GTTGAGACCA AAG AC CGGTG TCAGGGGATT 
TCAATGATTT TTTATGGTGC TCCAGGAAGT CAACTCTGGT TTCTGGCCAC AGTCCCCTAA 



+2 uHisLysSer LeuThrAspVal AlaLeuGlu HisHisGlu GluCysAspCys ValCysAr 



1081 


GCACAAATCA 
CGTGTTTAGT 


CTCACCGACG 
GAGTGGCTGC 


TGGCCCTGGA 
ACCGGGACCT 


GCACCATGAG 
CGTGGTACTC 


GAGTGTGACT 
CTCACACTGA 


GTGTGTGCAG 
CACACACGTC 


+ 2 


gGlySerThr 


GlyGly 










1141 


AGGGAGCACA 
TCCCTCGTGT 


GGAGGATAGC 
CCTCCTATCG 


CGCATCACCA 
GCGTAGTGGT 


CCAGCAGCTC 
GGTCGTCGAG 


TTGCCCAGAG 
AACGGGTCTC 


CTGTGCAGTG 
GACACGTCAC 


1201 


CAGTGGCTGA 
GTCACCGACT 


TTCTATTAGA 
AAGATAATCT 


GAACGTATGC 
CTTGCATACG 


GTTATCTCCA 
CAATAGAGGT 


TCCTTAATCT 
AGGAATTAGA 


CAGTTGTTTG 
GTCAACAAAC 


1261 


CTTCAAGGAC 
GAAGTTCCTG 


CTTTCATCTT 
GAAAGTAGAA 


CAGGATTTAC 
GTCCTAAATG 


AGTGCATTCT 
TCACGTAAGA 


GAAAGAGGAG 
CTTTCTCCTC 


ACATCAAACA 
TGTAGTTTGT 


1321 


GAATTAGGAG 
CTTAATCCTC 


TTGTGCAACA 
AACACGTTGT 


GCTCTTTTGA 
CGAGAAAACT 


GAGGAGGCCT 
CTCCTCCGGA 


AAAGGACAGG 
TTTCCTGTCC 


AGAAAAGGTC 
TCTTTTC CAG 


1381 


TTCAATCGTG 
AAGTTAGCAC 


GAAAGAAAAT 
CTTTCTTTTA 


TAAATGTTGT 
ATTTACAACA 


ATTAAATAGA 
TAATTTATCT 


TCACCAGCTA 
AGTGGTCGAT 


GTTTCAGAGT 
CAAAGTCTCA 


1441 


TACCATGTAC 
ATGGTACATG 


GTATTCCACT 
CATAAGGTGA 


AGCTGGGTTC 
TCGACCCAAG 


TGTATTTCAG 
ACATAAAGTC 


TTCTTTCGAT 
AAGAAAGCTA 


ACGGCTTAGG 
TGCCGAATCC 


1501 


GTAATGTCAG 


TACAGGAAAA 


AAACTGTGCA 


AGTGAGCACC 


TGATTCCGTT 


GCCTTGCTTA 
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1561 ACTCTAAAGC TCCATGTCCT GGGCCTAAAA TCGTATAAAA TCTGGATTTT TTTTTTTTTT 
TGAGATTTCG AGGTACAGGA CCCGGATTTT AGCATATTTT AGACCTAAAA AAAAAAAAAA 

1621 TTTGCTCATA TTCACATATG TAAACCAGAA CATTCTATGT ACTACAAACC TGGTTTTTAA 
AAACGAGTAT AAGTG TATAC ATTTGGTCTT GTAAGATACA TGATGTTTGG ACCAAAAATT 

1681 AAAGGAACTA TGTTGCTATG AATTAAACTT GTGTCGTGCT GATAGGACAG ACTGGATTTT 
TTTCCTTGAT AC AAC GAT AC TTAATTTGAA CACAGCACGA CTATCCTGTC TGACCTAAAA 

17 41 TCATATTTCT TATTAAAATT TCTGCCATTT AGAAGAAGAG AACTACATTC ATGGTTTGGA 

AGTATAAAGA ATAATTTTAA AGACGGTAAA TCTTCTTCTC TTGATGTAAG TACCAAACCT 

18 01 AGAGATAAAC CTGAAAAGAA GAGTGGCCTT ATCTTC AC TT TATCGATAAG CCAGTTTATT 

TCTCTATTTG GACTTTTCTT CTCACCGGAA TAGAAGTGAA ATAGCTATTC GGTCAAATAA 

18 61 TGTTTCATTG TGTACATTTT TATATTCTCC TTTTGACATT ATAACTGTTG GCTTTTCTAA 

ACAAAGTAAC ACATGTAAAA ATATAAGAGG AAAACTGTAA TATTGACAAC CGAAAAGATT 

1921 TCTTGTTAAA TATATCTATT TTTACCAAAG GTATTTAATA TTCTTTTTTA TG AC AAC TT A 
AGAACAATTT ATATAGATAA AAATGGT TTC CATAAATTAT AAGAAAAAAT ACTGTTGAAT 

19 81 GATCAACTAT TTTTAGCTTG GTAAATTTTT CTAAACACAA TTGTTATAGC CAGAGGAACA 

CTAGTTGATA AAAATCGAAC CATTTAAAAA GATTTGTGTT AACAATATCG GTCTCCTTGT 

2041 AAGATGATAT AAAATATTGT TGCTCTGACA AAAATACATG TATTTCATTC TCGTATGGTG 
TTCT AC TATA TTTTATAACA ACGAGACTGT TTTTATGTAC ATAAAGTAAG AG C ATACCAC 

2101 CTAGAGTTAG ATTAATCTGC ATTTTAAAAA AC TGAATTGG AATAGAATTG GTAAGTTGCA 
GATCTCAATC TAATTAGACG TAAAATTTTT TGACTTAACC TTATCTTAAC CATTCAACGT 

2161 AAGACTTTTT GAAAATAATT AAATTATCAT ATCTTCCATT CCTGTTATTG GAGATGAAAA 
TTCTGAAAAA CTTTTATTAA TTTAATAGTA TAGAAGGTAA GGACAATAAC CTCTACTTTT 

2221 TAAAAAGCAA CTTATGAAAG TAGACATTCA GATCCAGCCA TTACTAACCT ATTCCTTTTT 
ATTTTTCGTT GAATACTTTC ATCTGTAAGT CTAGGTCGGT AATGATTGGA TAAGGAAAAA 

22 81 TGGGGAAATC TGAGCCTAGC TCAGAAAAAC ATAAAGCACC TTGAAAAAGA CTTGGCAGCT 

ACCCCTTTAG ACTCGGATCG AGTCTTTTTG TATTTCGTGG AACTTTTTCT GAACCGTCGA 

23 41 TCCTGATAAA GCGTGCTGTG CTGTGCAGTA GGAACACATC CTATTTATTG TGATGTTGTG 

AGGACTATTT CGCACGACAC GACACGTCAT CCTTGTGTAG GATAAATAAC ACTACAACAC 

2 401 GTTTTATTAT CTTAAACTCT GTTCCATACA CTTGTATAAA TACATGGATA TTTTTATGTA 
CAAAATAATA GAATTTGAGA CAAGGTATGT GAACATATTT ATGTACCTAT AAAAATACAT 

2461 CAGAAGTATG TCTCT 
GTCTTCATAC AGAGA 
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New Sequence + Incyte ESTs 



1 ATTTGTTTAA ACCTTGGGAA ACTGGTTCAG GTCCAGGTTT TGCTTTGATC CTTTTCAAAA 
TAAACAAATT TGGAACCCTT TGACCAAGTC CAGGTCCAAA ACGAAACTAG GAAAAGTTTT 

61 ACTGGAGACA CAGAAGAGGG CTTCTAGGAA AAAGTTTTGG GATGGGATTA TGTGGAAACT 
TGACCTCTGT GTCTTCTCCC GAAGATCCTT TTTCAAAACC CTACCCTAAT ACACCTTTGA 

121 ACCCTGCGAT TCTCTGCTGC CAGAGCAGGC TCGGCGCTTC CACCCCAGTG CAGCCTTCCC 
TGGGACGCTA AGAGACGACG GTCTCGTCCG AGCCGCGAAG GTGGGGTCAC GTCGGAAGGG 

181 CTGGCGGTGG TGAAAGAGAC TCGGGAGTCG CTGCTTCCAA AGTGCCCGCC GTGAGTGAGC 
GACCGCCACC ACTTTCTCTG AGCCCTCAGC GACGAAGGTT TCACGGGCGG CACTCACTCG 

+2 Met SerLeuPhe GlyLeuLeu LeuLeuThrSer AlaLeuAl 

] _ 

241 TCTCACCCCA GTCAGCCAAA TGAGCCTCTT CGGGCTTCTC CTGCTGACAT CTGCCCTGGC 
AG AGTGGGG T CAGTCGGTTT ACTCGGAGAA GCCCGAAGAG GACGACTGTA GACGGGACCG 

+2 aGlyGlnArg GlnGlyThrGln AlaGluSer AsnLeuSer SerLysPheGln PheSerSe 

3 01 CGGCCAGAGA CAGGGGACTC AGGCGGAATC CAACCTGAGT AGTAAATTCC AGTTTTCCAG 
GCCGGTCTCT GTCCCCTGAG TCCGCCTTAG GTTGGACTCA TCATTTAAGG TCAAAAGGTC 

+2 rAsnLysGlu GlnTyrGlyVal GlnAspPro GlnHisGlu ArgllelleThr ValSerTh 



3 61 CAACAAGGAA CAGTACGGAG TACAAGATCC TCAGCATGAG AGAATTATTA CTGTGTCTAC 
GTTGTTCCTT GTCATGCCTC ATGTTCTAGG AGTCGTACTC TCTTAATAAT GACACAGATG 

+2 rAsnGlySer IleHisSerPro ArgPhePro HisThrTyr ProArgAsnThr ValLeuVa 



421 TAATGGAAGT ATTCACAGCC CAAGGTTTCC TCATACTTAT CCAAGAAATA CGGTCTTGGT 
ATTACCTTCA TAAGTGTCGG GTTCCAAAGG AGTATGAATA GGTTCTTTAT GCCAGAACCA 

+2 lTrpArgLeu ValAlaValGlu GluAsnVal TrpIleGln LeuThrPheAsp GluArgPh 



481 ATGGAGATTA GTAGCAGTAG AGGAAAATGT ATGGATACAA CTTACGTTTG ATGAAAGATT 
TACCTCTAAT CATCGTCATC TCCTTTTACA TACCTATGTT GAATGCAAAC TACTTTCTAA 

+2 eGlyLeuGlu AspProGluAsp AspIleCys LysTyrAsp PheValGluVal GluGluPr 



541 TGGGCTTGAA GACCCAGAAG ATGACATATG CAAGTATGAT TTTGTAGAAG TTGAGGAACC 
ACCCGAACTT CTGGGTCTTC TACTGTATAC GTTCATACTA AAACATCTTC AACTCCTTGG 

+2 oSerAspGly ThrlleLeuGly ArgTrpCys GlySerGly ThrValProGly LysGlnll 



601 CAGTGATGGA ACTATATTAG GGCGCTGGTG TGGTTCTGGT ACTGTACCAG GAAAACAGAT 
GTCACTACCT TGATATAATC CCGCGACCAC ACCAAGACCA TGACATGGTC CTTTTGTCTA 

+2 eSerLysGly AsnGlnlleArg IleArgPhe ValSerAsp GluTyrPhePro SerGluPr 



661 TTCTAAAGGA AATCAAATTA GGATAAGATT TGTATC TG AT GAATATTTTC CTTCTGAACC 
AAGATTTCCT TTAGTTTAAT CCTATTCTAA ACATAGACTA CTTATAAAAG GAAGACTTGG 
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+2 oGlyPheCys IleHisTyrAsn IleValMet: ProGlnPhe ThrGluAlaVal SerProSe 



721 AGGGTTCTGC ATCCACTACA ACATTGTCAT GCCACAATTC ACAGAAGCTG TGAGTCCTTC 
TCCCAAGACG TAGGTGATGT TGTAACAGTA CGGTGTTAAG TGTCTTCGAC ACTCAGGAAG 

+2 rValLeuPro ProSerAlaLeu ProLeuAsp LeuLeuAsn AsnAlalleThr AlaPheSe 

781 AGTGCTACCC CCTTCAGCTT TGCCACTGGA CCTGCTTAAT AATGCTATAA CTGCCTTTAG 
TCACGATGGG GGAAGTCGAA ACGGTGACCT GGACGAATTA TTACGATATT GACGGAAATC 

+2 rThrLeuGlu AspLeuIleArg TyrLeuGlu ProGluArg TrpGlnLeuAsp LeuGluAs 

841 TACCTTGGAA GACCTTATTC GATATCTTGA ACCAGAGAGA TGGCAGTTGG ACTTAGAAGA 
ATGGAACCTT CTGGAATAAG CTATAGAACT TGGTCTCTCT ACCGTCAACC TGAATCTTCT 

+2 pLeuTyrArg ProThrTrpGln LeuLeuGly LysAlaPhe ValPheGlyArg LysSerAr 



901 TC T AT AT AGG CCAACTTGGC AACTTCTTGG CAAGGCTTTT GTTTTTGGAA GAAAATCCAG 
AGATATATCC GGTTGAACCG TTGAAGAACC GTTCCGAAAA CAAAAACCTT CTTTTAGGTC 

+2 gValValAsp LeuAsnLeuLeu ThrGluGlu ValArgLeu TyrSerCysThr ProArgAs 



961 AGTGGTGGAT CTGAACCTTC TAACAGAGGA GGTAAGATTA TACAGCTGCA CACCTCGTAA 
TCACCACCTA GACTTGGAAG ATTGTCTCCT CCATTCTAAT ATGTCGACGT GTGGAGCATT 

+2 nPheSerVal SerlleArgGlu GluLeuLys ArgThxAsp ThrllePheTrp ProGlyCy 



1021 CTTCTCAGTG TC C AT AAGGG AAGAACTAAA GAGAACCGAT ACCATTTTCT GGCCAGGTTG 
GAAGAGTCAC AGGTATTCCC TTCTTGATTT CTCTTGGCTA TGGTAAAAGA CCGGTCCAAC 

+2 sLeuLeuVal LysArgCysGly GlyAsnCys AlaCysCys LeuHisAsnCys AsnGluCy 



1081 TCTC CTGGTT AAACGCTGTG GTGGGAACTG TGCCTGTTGT CTCCACAATT GCAATGAATG 
AGAGGACCAA TTTGCGACAC CACCCTTGAC ACGGACAACA GAGGTGTTAA CGTTACTTAC 

+2 sGlnCysVal ProSerLysVal ThrLysLys TyrHisGlu ValLeuGInLeu ArgProLy 



1141 TCAATGTGTC CCAAGCAAAG TTACTAAAAA ATACCACGAG GTCCTTCAGT TGAG AC C AAA 
AGTTACACAG GGTTCGTTTC AATGATTTTT TATGGTGCTC CAGGAAGTCA ACTCTGGTTT 

+2 sThrGlyVal ArgGlyLeuHis LysSerLeu ThrAspVal AlaLeuGluHis HisGluGl 



1201 GACCGGTGTC AGGGGATTGC ACAAATCACT . CACCGACGTG GCCCTGGAGC ACCATGAGGA 
CTGGCCACAG TCCCCTAACG TGTTTAGTGA GTGGCTGCAC CGGGACCTCG TGGTACTCCT 

+2 uCysAspCys ValCysArgGly SerThrGly Gly 

> 

1261 GTGTGACTGT GTGTGCAGAG GGAGCACAGG AGGATAGCCG CATCACCACC AGCAGCTCTT 
CACACTGACA CACACGTCTC CCTCGTGTCC TCCTATCGGC GTAGTGGTGG TCGTCGAGAA 

1321 GCCCAGAGCT GTGCAGTGCA GTGGCTGATT CTATTAGAGA ACGTATGCGT TATCTCCATC 
CGGGTCTCGA CACGTCACGT CACCGACTAA GATAATCTCT TGCATACGCA ATAGAGGTAG 

13 81 CTTAATCTC A GTTGTTTGCT TCAAGGACCT TTCATCTTCA GGATTTACAG TGCATTCTGA 
GAATTAGAGT CAACAAACGA AGTTCCTGGA AAGTAGAAGT CCTAAATGTC ACGTAAGACT 



SUBSTITUTE SHEET (RULE 26) 



WO 00/37641 



PCT/US99/30503 



11 / 54 

1441 AAGAGGAGAC ATCAAACAGA ATTAGGAGTT GTGCAACAGC TCTTTTGAGA GGAGGC CTAA 
TTCTCCTCTG TAGTTTGTCT TAATCCTCAA CACGTTGTCG AGAAAACTCT CCTCCGGATT 

1501 AGGACAGGAG AAAAGGTCTT CAATCGTGGA AAGAAAATTA AATGTTGTAT TAAATAGATC 
TCCTGTCCTC TTTTCCAGAA GTTAGCACCT TTCTTTTAAT TTACAACATA ATTT ATC TAG 

1561 ACCAGCTAGT TTCAGAGTTA CCATGTACGT ATTCCACTAG CTGGGTTCTG TATTTCAGTT 
TGGTCGATCA AAGTCTCAAT GGTACATGCA TAAGGTGATC GACCCAAGAC ATAAAGTCAA 

1621 CTTTCGATAC GGCTTAGGGT AATGTCAGTA CAGGAAAAAA ACTGTGCAAG TGAGCACCTG 
GAAAGCTATG CCGAATCCCA TTACAGTCAT GTCCTTTTTT TGACACGTTC ACTCGTGGAC 

16 81 ATTCCGTTGC CTTGGCTTAA CTCTAAAGCT CCATGTCCTG GGCCTAAAAT CGTATAAAAT 

TAAGGCAACG GAACCGAATT GAGATTTCGA GGTACAGGAC CCGGATTTTA GCATATTTTA 

17 41 CTGGATTTTT TTTTTTTTTT TTGCGCATAT TCACATATGT AAACCAGAAC ATTCTATGTA 

GACCTAAAAA AAAAAAAAAA AACGCGTATA AGTGTATACA TTTGGTCTTG TAAGATACAT 

1801 CTACAAACCT GGTTTTTAAA AAGGAAC TAT GTTGCTATGA ATTAAACTTG TGTCATGCTG 
GATGTTTGGA CCAAAAATTT TTCCTTGATA CAACGATACT TAATTTGAAC ACAGTACGAC 

18 61 ATAGGACAGA CTGGATTTTT CATATTTCTT ATTAAAATTT CTGCCATTTA GAAGAAGAGA 

TATCCTGTCT GACCTAAAAA GTATAAAGAA TAATTTTAAA GACGGTAAAT CTTCTTCTCT 

1921 ACTACATTCA TGGTTTGGAA GAGATAAACC TGAAAAGAAG AGTGGCCTTA TCTTCACTTT 
TGATGTAAGT ACCAAACCTT CTCTATTTGG ACTTTTCTTC TCACCGGAAT AGAAGTGAAA 

19 81 ATCGATAAGT CAGTTTATTT GTTTCATTGT GTACATTTTT ATATTCTCCT TTTGACATTA 

TAGCTATTCA GTCAAATAAA CAAAGTAACA CATGTAAAAA TATAAGAGGA AAACTGTAAT 

2 041 TAACTGTTGG CTTTTCTAAT CTTGT TAAAT ATATCTATTT TTACCAAAGG TATTTAATAT 
ATTGACAACC GAAAAGATTA GAACAATTTA TATAGATAAA AATGGTTTCC ATAAATTATA 

2101 TCTTTTTTAT GACAACTTAG ATCAACTATT TTTAGCTTGG TAAATTTTTC TAAACACAAT 
AGAAAAAATA CTGTTGAATC TAGTTGATAA AAATCGAACC ATTTAAAAAG ATTTGTGTTA 

2161 TGTTATAGCC AGAGGAACAA AGATGATATA AAATATTGTT GCTCTGACAA AAATACATGT 
ACAATATCGG TCTCCTTGTT TCTACTATAT TTTATAACAA CGAGACTGTT TTTATGTACA 

2221 ATTTCATTCT CGTATGGTGC TAGAGTTAGA TTAATCTGCA TTTTAAAAAA CTGAATTGGA 
TAAAGTAAGA GCATACCACG ATCTCAATCT AATTAGACGT AAAATTTTTT GACTTAACCT 

2 2 81 ATAGAATTGG TAAGTTGCAA AG AC TTTTTG AAAATAATTA AATTATCATA TCTTCCATTC 
TATCTTAACC ATTCAACGTT TCTGAAAAAC TTTTATTAAT TTAATAGTAT AGAAGGTAAG 

2 341 CTGTTATTGG AGATGAAAAT AAAAAGCAAC TTATGAAAGT AGACATTCAG ATCCAGCCAT 
GACAATAACC TCTACTTTTA TTTTTCGTTG AATACTTTCA TCTGTAAGTC TAGGTCGGTA 

2 401 TACTAACCTA TTCCTTTTTT GGGGAAATCT GAGCCTAGCT CAGAAAAACA TAAAGCACCT 
ATGATTGGAT AAGGAAAAAA CCCCTTTAGA CTCGGATCGA GTCTTTTTGT ATTTCGTGGA 

2 461 TGAAAAAGAC TTGGCAGCTT CCTGATAAAG CGTGCTGTGC TGTGCAGTAG GAACACATCC 
ACTTTTTCTG AACCGTCGAA GGACTATTTC GCACGACACG ACACGTCATC CTTGTGTAGG 

2 521 TATTTATTGT GATGTTGTGG TTTTATTATC TTAAACTCTG TTCCATACAC TTGTATAAAT 
ATAAATAACA CTACAACACC AAAATAATAG AATTTGAGAC AAGGTATGTG AACATATTTA 
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2581 ACATGGATAT TTTTATGTAC AGAAGTATGT CTCTTAACCA GTTCACTTAT TGTACTCTGG 

TGTACCTATA AAAATACATG TCTTCATACA GAGAATTGGT CAAGTGAATA ACATGAGACC 

2641 CAATTTAAAA GAAAATCAGT AAAATATTTT GCTTGTAAAA TGCTTAATAT CGTGCCTAGG 

GTTAAATTTT CTTTTAGTCA TTTTATAAAA CGAACATTTT ACGAATTATA GCACGGATCC 

27 01 TTATGTGGTG ACTATTTGAA TCAAAAATGT ATTGAATCAT CAAATAAAAG AATGTGGCTA 

AATACACCAC TGATAAACTT AGTTTTTACA TAACTTAGTA GTTTATTTTC TTACACCGAT 

2761 TTTTGGGGAG AAAATT 
AAAACCCCTC TTTTAA 



Additional oligonucleotides used for amplification of entire 




coding region 



5'-1 



TTTGTTTAAACCTTGGGAAACTGG 



5*-2 



GTCCAGGTTTTGCTTTGATCC 
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f- SCr. ^ DNA Sequence Of Clones 4 & 7, Identical Clones Containing The 
Entire Open Reading Frame 

1 TTTGTTTAAA CCTTGGGAAA CTGGTTCAGG TCCAGGTTTT GCTTTGATCC TTTTCAAAAA 

AAACAAATTT GGAACCCTTT GACCAAGTCC AGGTCCAAAA CGAAACTAGG AAAAGTTTTT 

61 CTGGAGACAC AGAAGAGGGC TCTAGGAAAA AGTTTTGGAT GGGATTATGT GGAAACTACC 

GACCTCTGTG TCTTCTCCCG AGATCCTTTT TCAAAACCTA CCCTAATACA CCTTTGATGG 

121 CTGCGATTCT CTGCTGCCAG AGCAGGCTCG GCGCTTCCAC CCCAGTGCAG CCTTCCCCTG 

GACGCTAAGA GACGACGGTC TCGTCCGAGC CGCGAAGGTG GGGTCACGTC GGAAGGGGAC 

181 GCGGTGGTGA AAGAGACTCG GGAGTCGCTG CTTCCAAAGT GCCCGCCGTG AGTGAGCTCT 

CGCCACCACT TTCTCTGAGC CCTCAGCGAC GAAGGTTTCA CGGGCGGCAC TCACTCGAGA 

+2 MetSer LeuPheGly LeuLeuLeu LeuThrSerAla LeuAlaGl 

] _ 

2 41 CACCCCAGTC AGCCAAATGA GCCTCTTCGG GCTTCTCCTG CTGACATCTG CCCTGGCCGG 

GTGGGGTCAG TCGGTTTACT CGGAGAAGCC CGAAGAGGAC GAC TGTAGAC GGGACCGGCC 

+2 yGlnArgGln GlyThrGlnAla GluSerAsn LeuSerSer LysPheGlnPhe SerSerAs 



3 01 C C AGAG AC AG GGGACTCAGG CGGAATCCAA CCTGAGTAGT AAATTCCAGT TTTCCAGCAA 
GGTCTCTGTC CCCTGAGTCC GCCTTAGGTT GGACTCATCA TTTAAGGTCA AAAGGTCGTT 

+2 nLysGluGln AsnGlyValGln AspProGln HisGluArg IlelleThrVal SerThrAs 

3 61 CAAGGAACAG AACGGAGTAC AAGATCCTCA GCATGAGAGA ATTATTACTG TGTCTACTAA 
GTTCCTTGTC TTGCCTCATG TTCTAGGAGT CGTACTCTCT TAATAATGAC ACAGATGATT 

+2 nGlySerlle HisSerProArg PheProHis ThrTyrPro ArgAsnThrVal LeuValTr 



421 TGGAAGTATT CACAGCCCAA GGTTTCCTCA TACTTATCCA AGAAATACGG TCTTGGTATG 
ACCTTCATAA GTGTCGGGTT CCAAAGGAGT ATGAATAGGT TCTTTATGCC AGAACCATAC 

+ 2 pArgLeuVal AlaValGluGlu AsnValTrp IleGlnLeu ThrPheAspGlu ArgPheGl 



4 81 GAGATTAGTA GCAGTAGAGG AAAATGTATG GATACAACTT ACGTTTGATG AAAGATTTGG 
CTCTAATCAT CGTCATCTCC TTTTACATAC CTATGTTGAA TGCAAACTAC TTTCTAAACC 

+2 yLeuGluAsp ProGluAspAsp IleCysLys TyrAspPhe ValGluValGlu GluProSe 



541 GCTTGAAGAC CCAGAAGATG ACATATGCAA GTATGATTTT GTAGAAGTTG AGGAACCCAG 
CGAACTTCTG GGTCTTCTAC TGTATACGTT CATACTAAAA CATC TTC AAC TCCTTGGGTC 

+2 rAspGlyThr IleLeuGlyArg TrpCysGly SerGlyThr ValProGlyLys GlnlleSe 



601 TGATGGAACT ATATTAGGGC GCTGGTGTGG TTCTGGTACT GTACCAGGAA AACAGATTTC 
ACTACCTTGA TATAATCCCG CGACCACACC AAGACCATGA CATGGTCCTT TTGTCTAAAG 

■t-2 rLysGlyAsn GlnlleArglle ArgPheVal SerAspGlu TyrPheProSer GluProGl 



6 61 TAAAGGAAAT CAAATTAGGA TAAGATTTGT ATCTGATGAA TATTTTCCTT CTGAACCAGG 
ATTTCCTTTA GTTTAATCCT ATTCTAAACA TAGACTACTT ATAAAAGGAA GACTTGGTCC 
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+2 yPheCysIle HisTyrAsnlle ValMetPro GlnPheThr GluAlaValSer ProSerVa 



721 GTTCTGCATC CACTACAACA TTGTCATGCC ACAATTCACA GAAGC TGTGA GTCCTTCAGT 
CAAGACGTAG GTGATGTTGT AACAGTACGG TGTTAAGTGT CTTCGACACT CAGGAAGTCA 

+2 lLeuProPro SerAlaLeuPro LeuAspLeu LeuAsnAsn AlalleThrAla PheSerTh 



7 81 GCTACCCCCT TCAGCTTTGC CACTGGACCT GCTTAATAAT GCTATAACTG CCTTTAGTAC 
CGATGGGGGA AGTCGAAACG GTGACCTGGA CGAATTATTA CGATATTGAC GGAAATCATG 

+2 rLeuGluAsp LeuIleArgTyr LeuGluPro GluArgTrp GlnLeuAspLeu GluAspLe 



841 CTTGGAAGAC CTTATTCGAT ATCTTGAACC AGAGAGATGG CAGTTGGACT TAGAAGATCT 
GAACCTTCTG GAATAAGCTA TAGAACTTGG TCTCTCTACC GTCAACCTGA ATCTTCTAGA 

' +2 uTyrArgPro' ThrTrpGlnLeu LeuGlyLys AlaPheVal PheGlyArgLys SerArgVa 

9 01 ATATAGGCCA ACTTGGCAAC TTCTTGGCAA GGCTTTTGTT TTTGGAAGAA AATCCAGAGT 
TATATCCGGT TGAACCGTTG AAGAACCGTT CCGAAAACAA AAACCTTCTT TTAGGTCTCA 

+2 lValAspLeu AsnLeuLeuThr GluGluVal ArgLeuTyr SerCysThrPro ArgAsnPh 



9 61 GGTGGATCTG AACCTTCTAA CAGAGGAGGT AAGATTATAC AGCTGCACAC CTCGTAACTT 
CCACCTAGAC TTGGAAGATT GTCTCCTCCA TTCTAATATG TCGACGTGTG GAGCATTGAA 

+2 eSerValSer IleArgGluGlu LeuLysArg ThrAspThr IlePheTrpPro GlyCysLe 



1021 CTCAGTGTCC ATAAGGGAAG AACTAAAGAG AACCGATACC ATTTTCTGGC CAGGTTGTCT 
GAGTCACAGG TATTCCCTTC TTGATTTCTC TTGGCTATGG TAAAAGACCG GTCCAACAGA 

+ 2 uLeuValLys ArgCysGlyGly AsnCysAla CysCysLeu HisAsnCysAsn GluCysGl 



1081 CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA ATGAATGTCA 
GGACCAATTT GCGACACCAC CCTTGACACG GACAACAGAG GTGTTAACGT TACTTACAGT 

+ 2 nCysValPro SerLysValThr LysLysTyr HisGluVal LeuGlnLeuArg ProLysTh 

1141 ATGTGTCCCA AGCAAAGTTA CTAAAAAATA CCACGAGGTC CTTCAGTTGA GACCAAAGAC 
TACACAGGGT TCGTTTCAAT GATTTTTTAT GGTGCTCCAG GAAGTCAACT CTGGTTTCTG 

+2 rGlyValArg GlyLeuHisLys SerLeuThr AspValAla LeuGluHisHis GluGluCy 



1201 CGGTGTCAGG GGATTGCACA AATCACTCAC CGACGTGGCC CTGGAGCACC ATGAGGAGTG 
GCCACAGTCC CCTAACGTGT TTAGTGAGTG GCTGCACCGG GACCTCGTGG TACTCCTCAC 

+ 2 sAspCysVal CysArgGlySer ThrGlyGly 

> 

1261 TGACTGTGTG TGCAGAGGGA GCACAGGAGG ATAGCCGCAT CACCACCAGC AGCTCTTGCC 
ACTGACACAC ACGTCTCCCT CGTGTCCTCC TATCGGCGTA GTGGTGGTCG TCGAGAACGG 

1321 CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT CTCCATCCTT 
GTCTCGACAC GTCACGTCAC CGACTAAGAT AATCTCTTGC ATACGCAATA GAGGTAGGAA 

13 81 AATCTCAGTT GTTTGCTTCA AGGACCTTTC ATCTTCAGGA TTTACAGTGC ATTCTGAAAG 
TT AG AG TC AA CAAACGAAGT TCCTGGAAAG TAGAAGTCCT AAATGTCACG TAAGACTTTC 

1441 AGGAGACATC AAACAGAATT AGGAGTTGTG CAA 
TCCTCTGTAG TTTGTCTTAA TCCTCAACAC GTT 
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Predicted FuINength Polypeptide Sequence 

1 MSLFGLLLLT SALAGQRQGT QAESNLSSXF QFSSNKEQYG VQDPQHERII 

51 TVSTNGSIHS PRFPHTYFRN TVLVWRLVAV EENVWIQLTF DERFGLEDPE 

101 DDICKYDFVE VEEPSDGTIL GRWCGSGTVP GKQISKGNQI RIRFVSDEYF 

151 P5SPGFCIHY NIVMPQFTEA VSPSVLPPSA LPLDLLNNAI TAFSTLEDLI 

201 RYLEPERWQL DLEDLYRPTW Q L LGXAFVFG RXSRWDLNL LTEEVRLYSC 

251 TPRNFSVSIR EELKRTDTIF WPGCLLVKRC GGNCACCLHN CNECQCVPSK 

3 01 VTKXYHEVLQ LRP XTGVRG L, HKSLTDVALE HHEECDCVCR GSTGG 
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Alignment of VEGF-X with Other VEGFs 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC„HUMAN 
VEGD_HUMAN 
990126vegx 



20 



40 



MSLFGLLLLTSALAGQRQGTQAESNLS SKFQF S SNKEQNGVQDPQHERI I 



50 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
99012 6vegx 



60 



80 



100 



TVSTNG S I HS PRF PHTYPRNTVLVWRLVAVEENVWI QLTFDERFGLEDPE 



100 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
99012 6vegx 



120 



140 



MHLLGFFSVACSLLAAALLPGPREAPAAAA 

MYREWVWNV 

DDICKYDFVEV — EEPSDGTILGRWCGSGTVPGKQISKGNQIRIRFVSDE 



30 
10 
148 



160 



180 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
99 012 6vegx 



200 
--MN 
— MP 



AFESGLDLSDAEPDAGEATAYASKDLEEQLRSVSSVDELMTVLYPEYWKM 
FMMLYVQLVQGSSNEHGPVKRSSQSTLERSEQQIRAASSLEELLRITHSE 
YFPSEPGFCIHYNIVMPQFTEAVSPSVLPPSALPLDLLNNAITAFSTLED 



2 
2 

80 
60 
198 



220 



240 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_ HUMAN 
990126vegx 



vmrlfpcflqllaglalpavppqqwalsagngssevewpfqe-vwi 

mspllrrlllaallqlapaqapvsqpdapghqrkvvswid-vytgat 

ykcqlrkggwqhnreqanlnsrteetikfaaahynteilksidne1 
dwklwrcrlrlksftsmdsrsashrstrfaatfydietlkvideewq 
liryleperwqldledlyrptwqllgkafvfgrksrwdlnllte; 



* 




3sy • 


51 


SSY 


51 




46 


Stq 


13 0 


htq 


110 


Sly 


: 248 



vegf__human 
plgf_human 
vegb_human 
vegc_human 
vegd_human 

99012 6vegx 



260 * 

chpietlvdifqHypdeieyifl 
cralerlvdws Sypsevehmfs 
cqprevwpltv : lmgtvakql 
CMPREVCIDVGK a fgvatntff: 
CSPRETCVEVAS gLGKSTNTF 
S CTPRNF S VS I R §ELKRTDT I F 



280 




300 

--egl: 

|GD--ENL] 

pd — dgl: 

S--EGD 
E--ESLI 
ILHNCNECQBESP 




96 
96 
91 
175 
155 
298 
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VEGF_HUMAN 
PLGF__HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
99012 6vegx 



* 320 
TEESNITMQHmrHkPHQG - QHIGBgjS FLQ 

vetanvtmqHlk rsgdr psyveHtfsq 

TGQHQVRMQMLM j RYPS . — SQLGeSsLE; 

tstsylsktHfe tvplsqg — pkpvtHsf; 

TSTSYISKQWFE I SVPLTSV PELVpSBkVj 

skvtkkyheBlqIIrpktgvrglhksltdSSale: 



340 




RPKKDRARQEK : 141 

RPLREKMKPER : 141 

IRPKKKDSAVKP : 13 5 

SKLDVYRQVH : 222 

LPTAPRHP YS I : 202 

GRGSTGG : 3 45 



360 * 380 * . 400 

VEGF_HUMAN : KSVRGKGKGQKRKRKKSRYKSWSVP 

PLGF_HUMAN : 

VEGB_HUMAN : DSPR 

VEGC_HUMAN ; SIIRRSLPATLPQCQAANKTCPTNYMWNNHICRCLAQEDFMFSSDAGDDS 

VEGD_HUMAN : IRRS I Q I PEEDRC SHSKKLC P I DMLWDSNKCKC VLQEENPLAGT 

990126vegx : 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_ HUMAN 
99012 6 vegx 



* 420 * 440 * 

TDGFHDICGPNKELDEETCQCVCRAGLRPASCGPHKEIiDRNSCQCVCKNK 
EDHSHLQEPALCGP 



322 
260 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
99 012 6 vegx 



460 * 480 * 500 

CGPCSERRKHLFVQDPQTCKC-SCKNTDSRCKARQLELNER : 2 06 

— CGDAVPRR-. : 149 

- PXiCPRCTQHHQRP.DPRTCRCRCRRRSFIiRCQGRGLELNPD : 17 9 

LFPSQCGANREFDENTCQCVCKRTCPRNQPLNPGKCACECTESPQKCLLK : 372 

HMMFDEDRCECVCKTPC PKDLI QHPKNC SC FECKES LETCCQKHKLFHPD : 310 



VEGF_HUMAN 
PLGF__HUMAN 
VEGB_HUMAN 
VEGC__HUMAN 
VEGD_HUMAN 
990126vegx 



520 



TCRCDKPRR- 



540 



TCRCRKLRR ■ 

GKKFHHQTCSCYRRPCTNRQKACEPGFSYSEEVCRCVPSYWKRPQMS 

TCSCEDRCPFHTRPCASGKTACAKHCRFPKEKRAAQGPHSRKNP 



215 

188 
419 
354 
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FL_seq 
clone41 
clone2 0 



Variant Polypeptide Sequences 

20 * 40 



MSLFGLL.LL.TSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERII 
MSLFGLIiLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERII 
ISLFGLLLLTSALAGORQGTOAESNLSSKFOFSSNKEONGVQDFQHERII 



50 
50 
50 



FL_seq 
clone41 
clone2 0 



60 



80 



100 



TVSTNGSIHSPRFPHTYPRNTVLVWRLVAVEENVIVIQLTFDERFGLEDPE 
TVSTKGSIHSPRFPHTYPRNTVLVWRLVAVEENVWIQLTFDERFGLEDPE 
WSTKGSIHSPRFPHTYPRNTVIiVVJRLVAVEEISJVl^IOLTFDERFGLEDPE 



100 
100 
100 



FL_seq 

clone41 

clone20 



120 



140 



DDICKYDFVEVEEPSDGTILGRWCGSGTVPGKQISKGNQIRIRFVSDEYF 
DDICKYDFVEVEEPSDGTILGRVJCGSGTVPGKQISKGNQIRIRFVSDEYF 
DDICKYDFVEVEEPSDGTILGRWCGSGTVPGKQISKGNQIRIRFVSDEYF 



150 
150 
150 



FL_seq 

clone41 

clone20 



PSEP 
PSEP 
PSEP 




* 180 * 200 

jPQFTEAVSPSVLPFSALPLDLLNNAITAFSTLEDLI : 200 

[HTS : 167 

PQFTEAVSPSVLFFSALPLDLLNNAITAFSTLEDLI : 20 0 



FL_seq 
clone41 
clone2 0 



* 220 * 240 * 

RYX*EPERWQIjDI^DLYRPTWQLLGKAFVFGRKSRVVDIjNLLTEEVRLiYSC 



RYLEPERWQLDIjEDLYRPTWQ LiLGKAFVFGRKSRWDLNIjIiTE - 



250 
243 



FL_seq 

clone41 

clone20 



260 * 280 * 300 

TPRNFSVSIREELKRTDTIFWPGCLLVKRCGGNCACCLiHNCNECQCVPSK 



300 



FL_seq 
clone41 
clone2 0 



* 320 * 340 

VTKKYHEVLQLRPKTGVRGLHKSLTDVALEHHEECDCVCRGSTGG 

EVLQLRPKTGVRGLHKSLTDVALEHHEECDCVCRGSTGG 



345 
282 
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Primers for Expression of VEGF-X 

E.coli expression of domain- 

vegx-6 AATTGGATCCQAGAGTGQTQGATCTQAACC 

vegx-7 AATTGGATCCGGGAAGAAAATCCAGAGTGG 

vegx-8 GGTTG AATTCATTA I lid I AGTAACTTTGCTTGGGACAC 

vegX-9 AATTGAATTCATTATCCTCCTGTGCTCCCTC 

. Baculovirus/insect cell expression of full-length protein- 
vegbad 

AATTGGATCCGGAGTCTCACCATCACCACCATCATGAATCCAACCTGAGTAGTAAATTC 
C 

vegbac2 AATTG AATTCGCTATCCTCCTGTGCTCCCTCTGC 
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>3993180HX LUNGNON03 INCYTE 

CACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGNGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCC 

GCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCAT 

CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAG 

AATTAGGAGTTGTGCAACAGCTCTyTTTGAGAGGAGGCTAAAGGACAGGAGAANAGGTCTT 

>3510192H1 CONCNOT01 INCYTE 

TGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTT 

TCATCTTCAGGATTTACAGTGCATTCTGAJWVGAGGAGACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAG 

GAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCA^ 

TCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTT 

>2559870H1 ADRETUT01 INCYTE 

CACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCA 

TGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGGGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGC 

AGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATC 

TCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGA 

>3979767H1 LUNGTUT08 INCYTE 

GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGC 
GTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCA 

ACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGA^ 

GAAAGAANATTAAATGTTGTATTAAATAGACACCAGCT 

>3980011H1 LUNGTUT08 INCYTE 

GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGC 

GTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAT 

CATCAAACAGAATTAGGAGTTGTGCAACAGCTCTCTTGAG 

AAAGAAAATT AAATGTTGTATTAAAT AG ATCAC C A 

>4825396H1 " BLADDIT01 INCYTE 

GAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCACAATT 
GCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTC 
AGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGG 

AGGATAGCCGCATCACCACCA 

>3073703H1 BONEUNT01 INCYTE 

AGAAAATCCAGAGTGGTGGATCTGAACCTTCTAACAGAGGAGGTAAGATTATACAGCTGCACACCTCGTAACTTCTCAGT 

GTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACT 

GTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACCACGAGGTCC 

TTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCA 

>1302516H1 PLACNOT02 INCYTE 

AGGAAATCAAATTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGAACCTTC 

AGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCT 

CCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCCACAATTGCAATGAATGTCAATGTGTCCCA^ 

ACTAAAAAATACCACGAGGTCC 

>3684109H1 HEAANOT01 INCYTE 

ATTTCATCTTCAGGATTTACAGTGCATTCTGAAANAGGAGAAATCAAACANAATTAGGAGTTGTGCAACAGCTCTTTTGA 

GAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAANAAAATTAAATGTTGTATTAAATAGATCACCAGCTA 

GTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAATGTCAG 

TACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTT 

>4713188H1 BRAIHCTO 1 INCYTE 

CAAAGTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCACTCACCG 
ACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCCGCATCACCACCAGCAG 
CTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGT 

TTGCT 

>458823HI KERANOTOl INCYTE 

ANGAGTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTA^TCTCAGTT 
GTTTCNTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATO 

CAACAGCTCTTTTGAGAGGAGGCCTAAAGGNCAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTA 
- AT AGATC 

>1303909H1 PLACNOT02 INCYTE 

AGGAAATCAAATTAGGATAAGAT1TGTATCTGATGAATATTTTCCTTCTGAACCTTCTAACAGAGGAGGTAAGATTA 

AGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCT 

CCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAG 
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>2739211H1 OVARNOT09 INCYTE 

GTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGA 
GAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATC 
TATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAA 
GTGAGCACCTGAT 

>3325591H1 PTHYNOT0 3 INCYTE 

TGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCA\TCGTGGAAAGAAAATTAAATGTTGTATT 

AAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACG 

GCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACCCTAAAGCNCC 

ATGTCNNGGGCNAAAANCGAAAAAT 

>3733565H1 SMCCNOS01 INCYTE 

CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGNAAGANGAGACATCAAACAG 

AATTAGGNGTTGTGCAAAAGCT'CTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTNCAATCGTGGAAAGNAAAT^ 

AAATGTTGTATNAAATNGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGNCNGTATTCAGTCT 

TTCGGAACGGCTTAGGGTAATGTCAGTACAGGANAAAAACTGTGCAGTGAG 

>3554223H1 SYNONOT01 INCYTE 

ATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGAT 

ACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGGCTTAACTCTAAAG 

CTCCATGTCCTGGGCCTAAAATCGTATAAAATCTGGATTTTTTTNTTTTTTTTTGCGCATATTCAC 

ACATTCTATGTACNACAAACCTGGTTTTTAAAAAGGAAC 

>4507477H1 OVARTDT01 INCYTE 

GGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAAT 
GTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCC 
TAAAATCGTATAAAATCTGGA 

>416337 8H1 BRSTNOT3 2 INCYTE 

AATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGNTCTGTATTTCAGTTCCTTTCGATACG 
GCTTAGGGTAATGTCAGTACAGGAAAAAAGCTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCC 
ATGTCCTGGGCCTAAAATCGTATA 
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>2054675H1 BEPINOT01 INCYTE 

AAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAATT 

TCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTTATCTTCACTT 

TATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCTTTTGACATTATAACTGTTGGCTTTTCTAA 

TCTTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATTCTTTTTTA 

>3 993180H1 LUNGNON03 INCYTE 

CACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGNGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCC 

GCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCAT 

CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAG 

AATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCTAAAGGACAGGAGAANAGGTCTT 

>3510192H1 CONCNOT01 INCYTE 

TGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTT 

TCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAG 

GAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCACCAGCTAGTT 

TCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTT 

>4164633H1 BRSTNOT3 2 INCYTE 

CTTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATTCTTTANTTATGACAACTTAGATCAACTATTTTTAGCTTG 

GTAAATTTTTCTAAACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATG 

TATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCA 

AAGACTTTTTGANAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGGGGAGAAAAT 

>2559870H1 ADRETUT01 INCYTE 

CACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCACTCA.CCGACGTGGCCCTGGAGCACCA 
TGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGGGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGC 
AGTGCAGTGGCTGATTC TATTAGAGAACGTATGC GTTATCTCC ATC CTTAATCTC AGTTGTTTGCTTC AAGGAC CTTTC A 
TCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGA 
>3817470H1 BONSTUT01 INCYTE 

TTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCATGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAA 
AATTTCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTTATCTTC 
ACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCTTTTGACATTATAACTGTTGGCTTTC 
T AATCTGTT AAATAT AT C T ATTTTT AC C AAAGGTATTTAAT ATTC TTT 
>3979767H1 LUNGTUT08 INCYTE 

GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGC 

GTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAG 

ACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTG 

GAAAGAANATTAAATGTTGTATTAAATAGACACCAGCT 

>3980011H1 LUNGTUT08 INCYTE 

GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGC 

GTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACATGCATTCTGAAAGAGGAGA 

CATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGG 

AAAGAAAATTAAATGTTGTATTAAATAGATCACCA 

>482539 6H1 BLADDIT01 INCYTE 

GAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCACAATT 
GCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTC 
AGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGG 
AGGATAGCCGCATCACCACCA 

>3073703H1 BONEUNT01 INCYTE 

AGAAAATCCAGAGTGGTGGATCTGAACCTTCTAACAGAGGAGGTAAGATTATACAGCTGCACACCTCGTAACTTCTCAGT 

GTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACT 

GTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACCACGAGGTCCTTCAG 

TTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCA 

>862169H1 BRAITUT03 INCYTE 

AGATGATATAAAATATTGTTGCTCTGACAAAAATAC ATGTATTTCATTCTC GTATGGTGC TAGAGTTAGATT AATCTGC A 
TTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTTTTTGAAAATAATTAAATTA 

CTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTT 
GGGGAAATCTGAGCCTAGC 

>4201385H1 BRAITUT29 INCYTE 

TTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATAT 

TAAAATTTCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTATCT 

TCACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCTTTGACATATAACTGTTGGCTTTT 
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CTAATCTGTTAAATATATCTATTTTTACCAAAGGTATTTAATAT 
>1302516H1 PLACNOT02 INCYTE 

AGGAAATCAAATTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGAACCTTCTAACAGAGGAGGTAAGATTATAC 
AGCTGCACACCTCGTAACTTCTCAGTGTCCATAJ^GGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCT 
CCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTT 
ACTAAAAAATACCACGAGGTCC 

>3684109H1 HEAANOT01 INCYTE 

ATTTCATCTTCAGGATTTACAGTGCATTCTGAAANAGGAGAAATCAAACANAATTAGGAGTTGTGCAACAGCTCTTTTGA 

GAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAANAAAATTAAATGTTGTATTAAATAGATCACCAGCTA 

GTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAATGTCAG 

TACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTT 

>2549720H1 LUNGTUT06 INCYTE 

TTAGCTTGGNAAATTTTTCTAAACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAA 
AATACATGTATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGT 
AAGTTGCAAAGACTTTTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACT 
TATGANAGTAG 

>877279H1 LUNGAST01 INCYTE 

CTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCTAAACACAATTGTTATAGCCAGAGGAACAAA 
GATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCAT 
TTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGGCTTTTTGAAAATAATTAAATTATCATATCTTCCATTCC 
TGTTATTGGNGG 

>4713188H1 BRAIHCT01 INCYTE 

CAAAGTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCACTCACCG 
ACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCCGCATCACCACCAGCAG 
CTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGT 
TTGCT 

>2171082H1 ENDCNOT03 INCYTE 

AGATAAACCTGAAAAGAAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTA 
TATTCTCCTTTTGACATTATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATT 
CTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCTAAACACAATTGTTATAGCCAGAGGAACAAA 
GATGA 

>875860H1 LUNG AST 01 INCYTE 

CTGGATTTTTCATATTTCTTATTAAAATTTCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGAAGAGATAAACC 
TGAAAAGAAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCT 
TTTGACATTATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATTCTTTTTTAT 
GAC 

>706168H1 SYNORAT04 INCYTE 

GCTCATATTCACATATGTAAACCAGAACATTCTATGTACTACAAACCTGGTTTTTAAAAAGGANCTATGTTGCTATGAAT 
TAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAATTTCTGCCATTTAGAAGAAGAGAAC 
TACATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTTATCTTCANTTTATCGATAAGTCAGTTTATTTGT 
TTCA 

>458823H1 KERANOT01 INCYTE 

ANGAGTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTT 
GTTTGNTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTG 
CAACAGCTCTTTTGAGAGGAGGCCTAAAGGNCAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAA 
ATAGATC 

>538436H1 LNODNOT02 INCYTE 

AAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTG 
CATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTTTTTGAAAATAATTAAATTATCATATCTTCCAT 
TCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTAT 
>I303909H1 PLACNOT02 INCYTE 

AGGAAATCAAATTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGAACCTTCTAACAGAGGAGGTAAGATTATAC 
AGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCT 
CCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAG 
>2739211H1 OVARNOT09 INCYTE 

GTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGA 
GAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACG 
TATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAA 
GTGAGCACCTGAT 
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>255034 3H1 LUNGTUT06 INCYTE 

TGTACATTTTTATATTCTCCTTTTGACATTATAACTGTTGGCTTTTCNAATCTTGTTAAATATATCTATTTTTACCAAAG 
GTATTTAATATTCTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCTAAACACAATTGTTATAGC 
CAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTCGTATGGTGCTA 
>5321148H1 FIBPFEN06 INCYTE 

CACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGNCAAAAATACATGTATTTCA.TTCTCGTA 
TGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTTTTTGAAAA 
TAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAAATTCAGATCCAC 
CATTACTAAC 

>879495H1 THYRNOT02 INCYTE 

ATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAA 
AGACTTTTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAG 
AGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCT 
TGAAAAA , 

>3325591H1 PTHYNOT03 INCYTE 

TGC AAC AGCTC TTTTGAGAGGAGGCC TAAAGGAC AGGAGAAAAGGTCTTC AATC GTGG AAAGAAAATTAAATGTTGT ATT 

AAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACG 

GCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACCCTAAAGCNCC 

ATGTCNNGGGCNAAAANCGAAAAAT 

>543890H1 OVARNOT02 INCYTE 

TTTCTAAACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCA 
TTCTCGTATGGTGC TAGAGTTAGATTAATCTGCATTTTAAAAAAC TGAATTGGNATAGAATTGGTAAGTTGC AAAGNC TT 
TTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGGATGGAAAATAAAAAGCAACTTATGGAAAGTAGG 
ACATTCAGATC 

>3733565H1 SMCCNOS01 INCYTE 

CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGNAAGANGAGACATCAAACAG 

AATTAGGNGTTGTGCAAAAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTNCAATCGTGGAAAGNAAATT 

AAATGTTGTATNAAATNGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGNCNGTATTCAGTCT 

TTCGGAACGGCTTAGGGTAATGTCAGTACAGGANAAAAACTGTGCAGTGAG 

>464193 9H1 PRO STMT 03 INCYTE 

GTACTACAAACCTGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCCATGCTGATAGGACAGACTGGAT 
TTTNCATATTTCTTATTAAAATTTCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGNAGAGATA^ 
GAAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTGTTTCATGTGTACATTTTTATATTCTCCTTTGACAT 
ATAACGTGGCTTT 

>2 007780H1 TESTNOT03 INCYTE 

TTATATTCTCCTTTTGACATTATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCTATTTTTACCAAAGGTATTTAAT 
ATTCTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCTAAACACAATTGTTATAGCCAGAGGAAC 
AAAGATGATATAAAATATTGTTGCTCTGANAAAAATACATGTAT 
>3085331H1 HEAONOT03 INCYTE 

GCTCATATTCACATATGTAAACCAGAACATTCTATGTACTACAAACCTGGTTTTTAAAAAGGAACTATTTGCTATGAATT 
AAACTTGTGTCGTGCTGATAGGACAGACTGGNTTTTTCATATTTCTTATTANAATTTCTGCCATTAGAAGAAGAGAACTA 
CATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTATTTCACTTTATCGATAAGTCAGT 
>3414043H1 PTHYNOT04 INCYTE 

GCTCATATTCACATATGTAAACCAGAACATTCTATGTACTACAAACCTGGTTTTTAAAAAGGAACTATGTTGCTATGAAT 
TAAACTT GTGTC GTGCTG AT AGGAC AGACTGGATTTTTCATATTTCTTATTAAAATTTC TGCC ATTTAGAAG AAGAGAAC 
TACATTCATGGTTTGGAAGAGATAAACCTGAAA 
>3705963H1 PENCNOT07 INCYTE 

ANACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCCTAAAATCGTATAAAA 

TCTGGArmnnimnnnimnnnnnnnGCTCATATTCACATATGTAAACCAGAACATTCTATGTACTACAAACCTGGTTTTTA 

AAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAAT 

TTCTGCCATTAGAAGAAGAGAACTACNTTCANGGTTTGGAAGAGATAACCCTGAAAAGANGGG 

>5137051H1 OVARDIT04 INCYTE 

AAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTNTTTGAAAATAATTAAATTATCATATCTTCCATTCCTGT 

TATTGGAGATGAANATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGG 

AAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTNTGTCA 

GTAGGAACACATCCTATTTATTGTGATGNTGTGGTTTATTAT 

>3 554223H1 SYNONOT01 INCYTE 

ATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGAT 
ACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGGCTTAACTCTAAAG 
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CTCCATGTCCTGGGCCTAAAATCGTATAAAATCTGGATTTTTTTNTTTTTTTTTGCGCATATTCACATATGTAAACCAGN 

ACATTCTATGTACNACAAACCTGGTTTTTAAAAAGGAAC 

>4507477H1 OVARTDTO 1 INCYTE 

GGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAAT 
GTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCC 
TAAAATCGTATAAAATCTGGA 

>1955646H1 CONNNOT01 INCYTE 

TGGTAAGTTGCAAAGACTTTTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGC 

AACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAA 

ACATAAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTGCTGTGCAGTAGGGAACACATCCTATTTA 

TTGTGATGTTGTGGTTTATATCCTAAACC 

>4163378H1 BRSTNOT32 INCYTE 

AATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGNTCTGTATTTCAGTTCCTTTCGATACG 
GCTTAGGGTAATGTCAGTACAGGAAAAAAGCTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCC 
ATGTCCTGGGCCTAAAATCGTATA 

>5095141H1 EPIMNON05 INCYTE 

AGATAAACCTGAAAAGAAGAGTGGCCTTATNTTCACTTTATCGATAAGTCAGNTTATTTGTTTCATTGTGTACATTTNNA 
TATTCTCCTTTTGACATTATAACTGNTGGCTTTTCTAANGNTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATT 
CTTT 

>943826H1 ADRENOT 03 INCYTE 

TATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTTTTTGAA 

AATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATG 

>3451273H1 UTRSNONO 3 INCYTE 

TTTTTTNTTTTGCTCATATTCACATATGTAAACCNGAACATTCTATGTACNACAAACCTGGTTTTTAAAAAGGAACTATG 
TTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCANATTTCTTANTAANNTTTCTGCCATTTAG 
AAGA 

>1402278H1 LATRTUT02 INCYTE 

GTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCCTAAA 
ATCGTATAAAATCTGGAnrinnnnnnnnnnnnrmn^^ 

CCTGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTT 
CTTA 

>4361191H1 SKIRNOT01 INCYTE 

GCAAAGACTTTTTGANAATNATTAANTTATCATATCTTCCATTCCTGTTATNGGAGATGANAATAAAAAGCAACTTATGA 
AAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCNCAGAAAAACATAAAGC 
ACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTGCTGTGCAGTAGGAACACATCCNATTTATTGTGNTGTN 
GNGGTTTTATGAT C 

>1307017H1 PLACNOT02 INCYTE 

TGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGC 
CTAAAATCGTATAAAATCTGGAnnrmnnnnimnrmnnnnn^ 

ACAAACCTGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCATGCTGATAGGACAGACTGGATTTTTCA 
TAT 

>5032225H1 HEARFET03 INCYTE 

AATTATCATATCTTCCATTCCTGTTATTGGAGATGNAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCAT 
TACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAAAGACTGTCAGCTTC 
CTGATAAAGCGTGCTGTGCTGTGCAGTAGGAACACATCCTATTTATTGTGATGTTGTGGTTTTATTATCTTAAACTCGTT 
CCAT 

. >3732621H1 SMCCNOS01 INCYTE 

ANAGATGATATAAAANATTGTTGCTCTGACAANNATACATGTATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTG 
CNTTTTAAAAAACTGANTTGGAATAGANTTGGTAAGTTGCAAAGNCNTTTGAAAATNATTAAGTTATCAGAT 
>3530274H1 BLADNOT09 INCYTE 

TTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTATT 

CCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCG 

TGCTGTGCTGTGCAGTAGGAACACATCCTATTTATTGTGATGTTGTGGTTTTATTATCTAAACTCTGTTCCATACACTTG 

TATAAATACATGGATATTTTTATGTACAGAAGTATGTCTCTTAACCAGTTCA 

>3530249H1 BLADNOT0 9 INCYTE 

CTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGANAGTAGACATTCAGATCCAGCCATTACTAACCTAT 
TCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGC 
GTGCTGTGCTGTGCAGTAGGAACACATCCTATTTATTGTGATGTTGTGGTTTTATTATCTTAAACTCTGTTC 
TGTATAAATACATGGATATTTTTATGTACAGAAGTATGTCTCTTAACCAGTTCACTTATTGTACCTGG 
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VEGFEl 


AAAATGTATGGATACAACTTAC 


22 


VEGFE2 


GTTTGATGAAAGATTTGGGCTTG 


23 


VEGFE3 


TTTCTAAAGGAAATCAAATTAG 


22 


VEGFE4 


GATAAGATTTGTATCTGATG 


20 


VEGFE5 


GATGTCTCCTCTTTCAG 


17 


VEGFE6 


GCACAACTCCTAATTCTG 


18 


VEGFE7 


AGCACCTGATTCCGTTGC 


19 


VEGFE8 


TAGTACATAGAATGTTCTGG 


20 


VEGFE9 


AAGAGACATACTTCTGTAC 


19 


VEGFEl 0 


CCAGGTACAATAAGTGAACTG 


21 
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+ 3 M N I F L L 

NLLT EEV RLY 

] 



1 AGGAAATCAA ATTAGGATAA GATTTGTATC TGATGAATAT TTTCCTTCTG 
AACCTTCTAA CAGAGGAGGT AAGATTATAC 

TCCTTTAGTT TAATCCTATT " CTAAACATAG ACTACTTATA AAAGGAAGAC 
TTGGAAGATT GTCTCCTCCA TTCTAATATG 

+ 3SCTP RNF . SVS IREE LKR 

TDTIFWPGCL 

j 



81 AGCTGCACAC CTCGTAACTT CTCAGTGTCC ATAAGGGAAG AACTAAAGAG 
AACCGATACC ATTTTCTGGC CAGGTTGTCT 

TCGACGTGTG GAGCATTGAA GAGTCACAGG TATTCCCTTC TTGATTTCf C 
TTGGCTATGG TAAAAGACCG GTCCAACAGA 
-2 



+ 3 LVK RCGG NCA CCL HNCN 
ECQ CVP SKV 



161 CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA 
ATGAATGTCA ATGTGTCCCA AGCAAAGTTA 

GGAC C AATTT GCGACACCAC CCTTGACACG GACAACAGAG GTGTTAACGT 
TACTTACAGT TACACAGGGT TCGTTTCAAT 

-2 







+3 T 


K 


, K 


Y 


H 


E 


V 


LQLR PKT GVR 


G 


L 


H K 


S 


L 


T 


D 


V 


A 








+ 1 














V S G 


D 


C 


T N 


H 


S 


P 


T 


w 


P 






















] — 



2 41 CTAAAAAATA CCACGAGGTC CTTCAGTTGA G AC C AAAGAC CGGTGTCAGG 
G GATTGC AC A AATCACTCAC CGACGTGGCC 

GATTTTTTAT GGTGCTCCAG GAAGTCAACT CTGGTTTCTG GCCACAGTCC 
CCTAACGTGT TTAGTGAGTG GCTGCACCGG 

-2 

[ 



+3LEHH EEC DCV CRGS TGG 
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> 



+ 2 . V Q R E H R R 

IAAS PPA ALA 



] 



+ 1 WST MRSV TVC AEG AQED 
SRITTSSSC 



321 CTGGAGCACC ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG 
ATAGCCGCAT CACCACCAGC AGCTCTTGCC 

GACCTCGTGG TACT.CCTCAC ACTGACACAC ACGTCTCCCT CGTGTCCTCC 
TATCGGCGTA GTGGTGGTCG TCGAGAACGG 

+ 2QS .CA VQW L I L LENV CVI 
S I L NLSC LLQ 



+ 1PELC SAV A D SI RER MRY 

LHP 
> 

401 CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT 
CTCCATCCTT AATCTCAGTT GTTTGCTTCA 

GTCTCGACAC GTCACGTCAC CGACTAAGAT AATCTCTTGC ATACGCAATA 
GAGGTAGGAA TTAGAGTCAA CAAACGAAGT 

+ 2 GPF IFRI YSA F 
> 

481 AGGACCTTTC ATC TTCAGG A TTTACAGTGC ATTCTGAAAG AGGAGACATC 
AAACAGAATT AGGAGTTGTG CAACAGCTCT 

TCCTGGAAAG TAGAAGTCCT AAATGTCACG TAAGACTTTC TCCTCTGTAG 
TTTGTCTTAA TCCTCAACAC GTTGTCGAGA 

561 TTTGAGAGGA GGCCTAAAGG ACAGGAGAAA AGGTCTTCAA TCGTGGAAAG 
AAAATTAAAT GTTGTATTAA ATAGATCACC 

AAACTCTCCT CCGGATTTCC TGTCCTCTTT TCCAGAAGTT AGCACCTTTC 
TTTTAATTTA CAACATAATT TATCTAGTGG 

641 AGCTAGTTTC AGAGTTACCA TGTAC GTATT CCACTAGCTG GGTTCTGTAT 
TTCAGTTCTT TCGATACGGC TTAGGGTAAT 

TCGATCAAAG TCTCAATGGT ACATGCATAA GGTGATC GAC CCAAGACATA 
AAGTCAAGAA AGCTATGCCG AATCCCATTA 

721 GTCAGTACAG GAAAAAAACT GTGCAAGTGA GCACCTGATT CCGTTGCCTT 
GGCTTAACTC TAAAGCTCCA TGTCCTGGGC 

CAGTCATGTC CTTTTTTTGA CACGTTCACT CGTGGACTAA GGCAACGGAA 
C C GAATTG AG ATTTC GAGGT ACAGGACCCG 

801 CTAAAATCGT ATAAAATCTG GA 
GATTTTAGCA TATTTTAGAC CT 
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+ 3 M N I F L L 

NLLTEEVRLY" 

j 



1 AGGAAATCAA ATTAGGATAA GATTTGTATC TGATGAATAT TTTCCTTCTG 
AACCTTCTAA CAGAGGAGGT AAGATTATAC 

TC CTTTAGTT TAATCCTATT C TAAAC AT AG ACTACTTATA AAAGGAAGAC 
TTGGAAGATT GTCTCCTCCA TTCTAATATG 

+ 3SCTP RNF SVS IREE LKR 
TDT IFWP GCL 

j _ 

81 AG CTGC AC AC CTCGTAACTT CTCAGTGTCC ATAAGGGAAG AACTAAAGAG 
AACCGATACC ATTTTCTGGC CAGGTTGTCT 

TC GACGTGTG GAGCATTGAA GAGTCACAGG TATTCCC TTC TTGATTTCTC 
TTGGCTATGG TAAAAGACCG GTCCAACAGA 
-2 



+ 3 1* V K RCGG NCA CCL HNCN 
ECQ CVP SKV 



161 CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA 
ATGAATGTCA ATGTGTCCCA AGCAAAGTTA 

GGACCAATTT GCGACACCAC CCTTGACACG G AC AACAGAG GTGTTAACGT 
TACTTACAGT TACACAGGGT TCGTTTCAAT 

-2 







+ 3 T 


K 
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H K 
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2 41 CTAAAAAATA CCACGAGGTC CTTCAGTTGA GACCAAAGAC CGGTGTCAGG 
GGATTGCACA AATCACTCAC CGACGTGGCC 

GATTTTTTAT GGTGCTCCAG GAAGTCAACT CTGGTTTCTG GCCACAGTCC 
CCTAACGTGT TTAGTGAGTG GCTGCACCGG 
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-2 



+ 3 L E H H EE C DCV CRGS TGG 



> 

+ 2 - VQREHRR 
IAAS PPA ALA 

] . 



+ 1 WST MRSV TVC AEG AQED 
SRI TT3 SSC 



321 CTGGAGCACC ATGAGGAGTG TGAC TGTGTG TGCAGAGGGA GCACAGGAGG 
ATAGCCGCAT CACCACCAGC AGCTCTTGCC 

GACCTC GTGG TACTCCTCAC ACTGACACAC ACGTCTCCCT CGTGTCCTCC 
TATCGGCGTA GTGGTGGTCG TCGAGAACGG 

+ 2QSCA VQW LIL LENV CVI 
S I L NL.SC LLQ 



+ 1PELC SAV AD S I RER MRY 

LHP 

> 

401 CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT 
CTCCATCCTT AATCTCAGTT GTTTGCTTCA 

GTCTCGACAC GTCACGTCAC C G AC TAAG AT AATCTCTTGC ATACGCAATA 
GAGGTAGGAA TTAGAGTCAA CAAACGAAGT 



+2 GPF IFRI YSA F 
> 

481 AGGACCTTTC ATCTTCAGGA TTTACAGTGC ATTCTGAAAG AGGAGACATC 
AAACAGAATT AGGAGTTGTG CAACAGCTCT 

TCCTGGAAAG TAGAAGTCCT AAATGTCACG TAAGACTTTC TCCTCTGTAG 
TTTGTCTTAA TCCTCAACAC GTTGTC GAGA 

561 TTTGAGAGGA GGCCTAAAGG ACAGGAGAAA AGGTCTTCAA TCGTGGAAAG 
AAAATTAAAT GTTGTATTAA AT AGATC AC C 

AAACTCTCCT CCGGATTTCC TGTCCTCTTT TCCAGAAGTT AGCACCTTTC 
TTTTAATTTA CAACATAATT TATCTAGTGG 



641 AGCTAGTTTC AGAGTTACCA TGTAC GTATT CCACTAGCTG GGTTCTGTAT 
TTCAGTTCTT TCGATACGGC TTAGGGTAAT 

TCGATCAAAG TCTCAATGGT ACATGCATAA GGTGATC GAC CCAAGACATA 
AAGTCAAGAA AGCTATGCCG AATCCCATTA 

7 21 GTCAGTACAG GAAAAAAACT GTGCAAGTGA G C AC C TG ATT CCGTTGCCTT 
GGCTTAACTC TAAAGCTCCA TGTCCTGGGC 
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CAGTCATGTC C TTTTTTTGA CACGTTCACT CGTGGACTAA GGCAACGGAA 
CCGAATTGAG ATTTC GAGGT ACAGGACCCG 

801 CTAAAATCGT ATAAAATCTG GATTTTTTTN TTTTTTTTTG CGCATATTCA 
CATATGTAAA C C AG AAC ATT CTATGTAGTA 

GATTTTAGCA TATTTTAGAC CTAAAAAAAN AAAAAAAAAC GCGTATAAGT 
GTATACATTT GGTCTTGTAA GATACATGAT 

881 CAAACCTGGT TTTTAAAAAG GAACTATGTT GCTATGAATT AAAC TTGTGT 
CGTGCTGATA GGACAGACTG GATTTTTCAT 

GTTTGGACCA AAAATTTTTC CTTGATACAA CGATACTTAA TTTGAACACA 
GCACGACTAT CCTGTCTGAC CTAAAAAGTA 

-3 < 



9 61 ATTTCTTATT AAAATTTCTG CCATTTAGAA GAAGAGAACT ACATTCATGG 
TTTGGAAGAG ATAAACCTGA AAAGAAGAGT 

TAAAGAATAA TTTTAAAGAC GGTAAATCTT CTTCTCTTGA TGTAAGTACC 
AAACCTTCTC TATTTGGACT TTTCTTCTCA 

-3 



1041 GGCCTTATCT TCACTTTATC GATAAGTCAG TTTATTTGTT TCATTGTGTA 
CATTTTTATA TTCTCCTTTT GACATTATAA 

CC GGAATAGA AGTGAAATAG CTATTCAGTC AAATAAACAA AGTAACACAT 
GTAAAAATAT AAGAGGAAAA CTGTAATATT 
-3 [ 

1121 CTGTTGGCTT TTCTAATCTT GTTAAATATA TCTATTTTTA CCAAAGGTAT 
TTAATATTCT TTTTTATGAC AACTTAGATC 

G AC AAC CG AA AAGATTAGAA CAATTTATAT AGATAAAAAT G GTTTC CAT A " 
AATTATAAGA AAAAATACTG TTGAATCTAG 

12 01 AACTATTTTT AGCTTGGTAA ATTTTTCTAA ACACAATTGT TATAGC CAG A 
GGAACAAAGA TGATATAAAA TATTGTTGCT 

TTGATAAAAA TCGAACCATT TAAAAAGATT TGTGTTAACA ATATCGGTCT 
CCTTGTTTCT ACTATATTTT ATAACAACGA 

1281 CTGACAAAAA TACATGTATT TCATTCTCGT ATG GTGC TAG AGTTAGATTA 
ATCTGCATTT TAAAAAACTG AATTGGAATA 

GACTGTTTTT ATGTACATAA AGTAAGAGCA TACCACGATC TCAATCTAAT 
TAGAC GTAAA ATTTTTTGAC TT AA C C TT AT 

13 61 GAATTG GTAA GTTGCAAAGA CTTTTTGAAA ATAATTAAAT TATCATATCT 
TCCATTCCTG TTATTG GAGA TGAAAATAAA 

CTTAAC C ATT CAACGTTTCT GAAAAACTTT TATTAATTTA ATAGTATAGA 
AGGTAAGGAC AATAACCTCT ACTTTTATTT 

1441 AAGCAACTTA TGAAAGTAGA CATTCAGATC CAGCCATTAC TAACCTATTC 
CTTTTTTGGG GAAATC TGAG CCTAGCTCAG 

TTCGTTGAAT ACTTTCATCT GTAAGTCTAG GTC GGTAATG ATTGGATAAG 
GAAAAAACCC CTTTAGACTC GGATCGAGTC 
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1521 AAAAACATAA AGCAGCTTGA AAAAGACTTG GCAGCTTCCT G ATAAAGC GT 
GCTGTGCTGT GCAGTAGGAA CACATCCTAT 

TTTTTGTATT TCGTGGAACT TTTTC TGAAC CGTCGAAGGA CTATTTCGCA 
C G AC AC G AC A CGTCATCCTT GTGTAGGATA 

1601 TTATTGTGAT GTTGTGGTTT TATTATCTTA AACTCTGTTC CATACACTTG 
TATAAATACA TGGATATTTT TATGTACAGA 

AATAACACTA CAACACCAAA ATAATAGAAT TTGAGACAAG GTATGTGAAC 
ATATTTATGT ACCTATAAAA ATACATGTCT 

1681 AGTATGTCTC TTAACCAGTT CACTTATTGT ACCTGG 
TCATACAGAG AATTGGTCAA GTGAATAACA TGGACC 
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£ DNA and polypeptide sequence used for mammalian cell expression 

+ 1 rnslf gll llts alagqr 

1 GGATCCAAAA TGAGCCTCTT CGGGCTTCTC CTGCTGACAT CTGCCCTGGC CGGCCAGAGA 

+ lqgtq aES NLS SKF Q F S S NKE 
61 CAGGGGACTC AGGCGGAATC CAACCTGAGT AGTAAATTCC AGT77TCCAG CAACAAGGAA 

+ 1GNGV QDP QHE RIIT VST NGS 
121 CAGAACGGAG TACAAGATCC TCAGCATGAG AGAATTATTA CTGTGTCTAC TAATGGAAGT 

+ 1IHSP RFP HTY PRNT V L V WRL 
181 ATXCACAGCC CAAGGTTTCC T C AT ACTT AT CCAAGAAATA CGGTCTTGGT ATGGAGATTA 

+ 1VAVE ENV WIQ LTFD ERF GLE 
241 GTAGCAGTAG AGGAAAATGT ATGGATACAA CTTACGTTTG ATGAAAGATT TGGGCTTGAA 

+ 1DPED DIC KYD FVEV EEP SDG 
301 GACCCAGAAG ATGACATATG CAAGTATGAT TTTGTAGAAG TTGAGGAACC CAGTGATGGA 

+ 1TILG RWC GSG T VPG KQI SKG 
361 ACTATATTAG GGCGCTGGTG TGGTTCTGGT ACTGTACCAG GAAAACAGAT TTCTAAAGGA 

+ 1NQIR IRF VSD E Y F P SEP GFC 
4 21 AATCAAATTA GGATAAGATT TGTATCTGAT GAATATT7TC CTTCTGAACC AGGGTTCTGC 

+1I HYN IVM pqf TEAV SPS VLP 
481 ATCCACTACA ACATTGTCAT GCCACAATTC ACAGAAGCTG TGA3TCCTTC AGTG CTACCC 

+ 1PSAL ?LD L Li N NAIT AFS TLE 
541 CCTTCAGCTT TGCCACTGGA CCTG CTTAAT AATGCTATAA CTGCCTTTAG TACCTTGGAA 

+ 1DLIR YLE PER WQLD LED LYR 
601 GACCTTATTC GAT AT CTTG A ACCAGAGAGA TGGCAGTTGG ACT7AGAAGA TCTATATAGG 

+ 1PTWQ.LLG KAF VrGR K*SR VVD 
661 CCAACTTGGC AACTTCTTGG CAAGGCTTTT GTTTTTGGAA GAAAATCCAG AGTGGTGGAT 

+ 1LNLL TEE VRL YSCT ?RN FSV 
721 CTGAACCTTC TAACAGAGGA GGTAAGATTA TACAGCTGCA CACCTCGTAA CTTCTCAGTG 

+ 1SIRE ELK RTD TIFW PGC LLV 
781 TCCATAAGGG AAGAACTAAA GAGAACCGAT ACCATTTTCT GGCCAGGTTG TCTCCTGGTT 

+ 1KRCG GNC ACC LHNC NEC QCV 
841 AAACGCTGTG GTC-3GAACTG TGCCTGTTGT CTCCACAA7T GCAATGAATG TCAATGTGTC 

+ 1PSKV TKK YHE VLQL ?. PK TGV 
901 CCAAGCAAAG TTACTAAAAA ATACCACGAG GTCCTTCAGT TGAGACCAAA GACCGGTGTC 

+ 1RGLH KSL TDV ALZH HEE CDC 
961 AGGGGATTGC ACAAATCACT CACCGACGTG GCCCTGGASC ACCA7GAGGA GTGTGACTGT 

+ 1VCR G STG G SR G?jE C-KP IPN 
1021 GTGTGCAGAG GGAGCACAGG AGGATCTAGA GGGCCC77CG AAGG7AAGCC TATCCCTAAC 

+ 1 P L L G LPS T R T G K H K H H H 
1081 CCTCTCCTCG GTCTCGATTC TACGCGTACC GGTCATCATC AC CAT CACCA TTGA 
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/^/2P. DNA aad polypeptide sequence used for baculovirus/insect cell expression 

1 GAATTCAAAG GCC7GTATTT TACTGTTTTC GTAACAGTTT TGTAATAAAA AAACCTATAA 

+ 3 mkf Ivn.valv fmv vyi s yi 
61 ATATGAAATT CTTAGTCAAC GTTGCCCTTG TTTTTATGGT CGTATACATT TCTTACATCT 

+ 3 y a D P ES H HHHH H ES NLS SKF 

121 ATGCGGATCC GGAGTCTCAC CATCACCACC ATCATGAATC CAACCTGAGT AGTAAATTCC 

+ 3QF SS NKE QNGV QDP' QHE RII 
181 AGTTTTCCAG CAACAAGGAA CAGAACGGAG TACAAGATCC TCAGCATGAG AGAATTATTA 

+ 3TVST WGS IHSP RFP HTY P R N 
241 CTGTGTCTAC TAATGGAAGT ATTCACAGCC CAAGGTTTCC T CAT ACTT AT CCAAGAAATA 

+ 3TVLV WRL VAVE ENV WIQ LTF 

3 01 CGGTCTTGGT AT GGAG ATT A GTAGCAGTAG AGGAAAATGT ATGGATACAA CTTACGTTTG 

♦ 3 D E R F GLE DP ED DIC KYD FVE 
361 ATGAAAGATT TGGGCTTGAA GACCCAGAAG ATGACATATG CAAGTATGAT TTTGTAGAAG 

+3 VEEP SDG TILG RWC GSG TVP 

4 21 TTGAGGAACC CAGTGATGGA ACTATATTAG GGCGCTGGTG TGGTTCTGGT ACTGTACCAG 

+ 3GKQI SKG NQIR I R F VSD EY'F 
4 81 GAAAACAGAT TTCTAAAGGA AATCAAATTA GGATAAGATT TGTAT CTGAT GAATATTTTC 

+ 3PSEP GFC IKYN IVM PQF TEA 
541 CTT CTGAACC AGGGTTCTGC ATCCACTACA ACATTGTCAT GCCACAATTC ACAGAAGCTG 

+ 3VSPS VLP PSAL PLD L L N N A I 
601 TGAG7CCTTC AGTGCTACCC CCTTCAGCTT TGCCACTGGA CCTGCTTAAT AATGCTATAA 

+ 3TAFS TLE DLIR YLE PER WQL 
661 CTGCCTTTAG TACCTTGGAA GACCTTATTC GATATCTTGA ACCAGAGAGA TGGCAGTTGG 

+ 3DLED LYR PTWQ L L G KAF VFG 
721 ACTTAGAAGA TCTATATAGG CCAACTTGGC AACTTCTTGG CAAGGCTTTT GTTTTTGGAA 

+ 3RKSR VVD LNLL TEE V R L YSC 
781 GAAAATCCAG AGTGGTGGAT CTGAACCTTC TAACAGAGGA GGTAAGATTA TACAGCTGCA 

+ 3TPRN FSV S IRE ELK RTD TIF 
841 CACCTCGTAA CTTCTCAGTG TCCATAAGGG AAGAACTAAA GAGAACCGAT ACCATTTTCT 

+ 3WPGC LLV KRCG GNC ACC LKN 
901 GGCCAGGTTG TCTCCTGGTT AAACGCTGTG GTGGGAACTG TGCCTGTTGT CTCCACAATT 

+ 3CNEC OCV PS.KV TKK .YHE VLQ 
961 GCAA7GAATG T CAATGTGTC CCAAGCAAAG TTACTAAAAA ATACCACGAG GTCCTTCAG7 

+ 3LRPK TGV RGLH.KSL TDV ALE 
1021 TGAGACCAAA GACCGGTGTC AGGGGATTGC ACAAATCACT CACCGACGTG GCCCTGGAGC 

+ 3HHEE SDC VCRG STG G 
1081 ACCATGAGGA GTGTGACTGT GTGTGCAGAG GGAGCACAGG AGGATAGCTC TAGA 
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s / Cr. Srs. DNA and polypeptide sequence used for E.coli expression 



+ 3 Q T N S S S N V N V - ^ N K N N N L G T 

1 CGCAGACTAA TTCGAGC TCG AACAACAACA ACAATAACAA TAACAACAAC CTCGGGATCG ~ 

+ 3 E G R I S E F E SNL SSK F Q F SS K 
61 AGGGAAGGAT TTCAGAATTC GAATCCAACC TGAGTAGTAA ATTCCAGTTT TCCAGCAACA 

+3KEQN GVQ DPQ H ERI ITV STN 
121 AGGAACAGAA CGGAGTACAA GATCCTCAGC ATGAGAGAAT TATTACTGTG TCTACTAATG 

+3GSIH SPR FPHT YPR NTV LVW 
181 GAAGTATTCA CAGCCCAAGG TTTCCTCATA CTTATCCAAG AAATACGGTC TTG GTATGGA 

+ 3RLVA VEE NVWI QLT FDE RFG 
241 G ATTAGT AG C AGTAGAGGAA AATGTATGGA TACAACTTAC GTTTGATGAA AGATTTGGGC 

+3LEDP EDD ICKY DFV EVE EPS 
301 TTGAAGACCC AGAAGATGAC ATATGCAAGT ATGATTTTGT AGAAGTTGAG GAACCCAGTG 

+3DGTI LGR WCGS GTV PGK QIS 
361 ATGGAACTAT ATTAGGGCGC TGGTGTGGTT CTGGTACTGT ACCAGGAAAA CAGATTTCTA 

+ 3 K G N* Q IRI RFVS DEY FPS EPG 
421 AAGGAAATCA AATTAGGATA AGATTTG TAT CTGATGAATA TTTTCCTTCT GAACCAGGGT 

+ 3FCIH YNI VMPQ FTE A V S PSV 
481 TCTGCATCCA CTACAACATT GTCATGCCAC AATTCACAGA AGCTGTGAGT CCTTCAGTGC 

+3LPPS ALP LDLL NNA ITA FST 
541 TACCCCCTTC AGCTTTGCCA CTGGACCTGC TTAATAATGC TATAACTGCC TTTAGTACCT 

+ 3LEDL IRY LEPE RWQ LDL EDL 
601 TGGAAGACCT TATTCGATAT CTTGAACCAG AGAGATGGCA GTTGGACTTA GAAGATCTAT 

+ 3Y RPT WQL LGKA FVF GRK S R V ' 
661 ATAGGCCAAC TTGGCAACTT CTTGGCAAGG CTTTTGTTTT TGGAAGAAAA TCCAGAGTGG 

+ 3 V D L N L L T E E V R L Y S C T P R N F 
721 TGGATCTGAA CCTTCTAACA GAGGAGGTAA GATTATACAG CTGCACACCT CGTAACTTCT 

+ 3SVSI REE LKRT DTI FWP G CI* 
781 CAGTGTCCAT AAGGGAAGAA CTAAAGAGAA CCGATACCAT TTTCTGGCCA GGTTGTCTCC 

+ 3 L V X R CGG NCAC CLH NCN ECQ 
841 TGGTTAAACG CTGTGGTGGG AACTGTGCCT GTTGTCTCCA CAATTGCAAT GAATGTCAAT 

+ 3CV?S KVT KKYH EVL QLR PKT 
901 GTGTCCCAAG CAAAGTTACT AAAAAATACC ACGAGGT CCT TCAGTTGAGA CCAAAGACCG 

+ 3GVRG LHK SLTD VAL EHH EEC 
961 GTGTCAGGGG ATTGCACAAA TCACTCACCG ACGTGGCCCT GGAGCAC CAT GAGGAGTGTG 

+ 3 D C V C R G S T G G H H H H K H * 
1021 ACTGTGTGTG CA3AGGGAGC ACAGGAGGAC ATCATCACCA TCACCATTGA TCTAGAGTCG 

1081 ACCTG CAGGC AAGCTT 
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Disuiphidoiinked dimerisation of VEGF-X 

(A) Mammalian cell expression 
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Glycosylation of VEGF-X 
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DNA and polypeptide sequence used for E.coli expression of the PDGF-like domain 

+ 3 MR G S H H H H H S_£l ASM 

1 AAGGAGATAT ACATATGCGG GGTTCTCATC AT CAT CAT CA TCATGGTATG GCTAGCATGA 

+ 3 TGGO OMG RDLY DDD DKD P G R 
61 CTGGTGGACA GCAAATGGGT CGGGATCTGT ACGACGATGA CGATAAGGAT CCGGGAAGAA 

+3KSRV VDL NLLT EEV RLY SCT 
121 AATCCAGAGT GGTGGATCTG AACCTTCTAA CAGAGGAGGT AAGATTATAC AGCTGCACAC 

+3PRNF SVS IREE LKR TDT IFW 
181 CTCGTAACTT CTCAGTGTCC ATAAGGGAAG AACTAAAGAG AACCGATACC ATTTTCTGGC 

+ 3PGCL LVK RCGG N C A CCL KNC 
241 CAGGTTGTCT CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA 

+3NECQ CVP SKV T KKY HEV LQL 
301 ATGAATGTCA ATGTGTC CCA AGCAAAGTTA CTAAAAAATA CCACGAGGTC CTTCAGTTGA 

+ 3RPKT GVR G L H K S LT D V A LEH 
3 61 GACCAAAGAC CGGTGTCAGG GGATTGCACA AATCACTCAC CGACGTGGCC CTGGAGCACC 

+3HEEC DCV CRGS TGG 
421 ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG ATAATGAATT CGAAGCTTGA 

481 TCCGGCTGCT AACAAAGCCC 
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Expression of PDGF domain in Rcoli 



1 2 3 
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DNA and polypeptide sequence used for E.coli expression of the CUB-like domain 



+ 2 MA MDTO INS DPE 5HHH HHH 

1 GGCGATGGCC A7GGATATCG GAATTAATTC GGATCCGGAG TCTCACCATC ACCACCATCA 

+ 2 ESN LSSK FQF SSN KEQN GVQ 
61 TGAATCCAAC CTGAGTAGTA AATTCCAGTT TTCCAGCAAC AAGGAACAGA ACGGAGTACA 

+ 2 DPQ HERI ITV STN GSIH S P R 
121 AGATCCTCAG CATGAGAGAA TTATTACTGT G T CT ACT AAT GGAAGTATTC A CAGC CCAAG 

+ 2 FPH TYPR NTV L VW R I* V A VEE 
181 GTTTCCTCAT ACTTATCCAA GAAATACGGT CTTGGTATGG AGATTAGTAG CAGTAGAGGA 

+ 2 NVW IQLT FDE R FG LEDP EDD 
241 AAATGTATGG ATACAACTTA CGTTTGATGA AAGATTTGGG CTTGAAGACC CAGAAGATGA 

+ 2 ICK YDFV EVE EPS DGTI LGR 
301 CATATGCAAG TATGATTTTG TAGAAGTTGA GGAACCCAGT GATGGAACTA TATTAGGGCG 

+ 2 WCG SGTV PGK QIS KGNQ IRI 
361 CTGGTGTGGT TCTGGTACTG TACCAGGAAA ACAGATTTCT AAAGGAAATC AAATTAGGAT 

+ 2 *R F V SDEY FPS EPG FCIH Y N I 
421 AAGATTTGTA TCTGATGAAT ATTTTCCTTC TGAACCAGGG TTCTGCATCC ACTACAACAT 

+ 2 VMP C FTE AV 
431 TGTCATGCCA CAATTCACAG AAGCTGTGTA GTCGAGCTCC GTCGACAAGC TTGCGGCCGC 

541 ACTCGAGCAC 
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Expression of the CUB domain in E.coli 



H CUB 




SUBSTITUTE SHEET (RULE 26) 



WO 00/37641 — PCT/US99/30S03 

42 / 45 



4 

! 

| 



fa/re - d/ay-/-s-ecz/ ! -#?e/7j*) 





SUBSTITUTE SHEET (RULE 26) 



WO 00/37641 



PCT/US99/30503 



43 / 54 



A6&?0S7 6fo£sy/ea/ l^/s? / r /70 / <?//?e//&/ <Te/te /r^ttr- 



























1 




1 




I 








% 


0 



i sr 



1 



SUBSTITUTE SHEET (RULE 26) 



WO 00/37641 — PCT/US99/30503 

44/54 



A **** 

5 ' 




V- 



\\\\\ 



v \ \ ^ \ \ 



\ 



SUBSTITUTE SHEET (RULE 26) 



WO 00/37641 



45/54 



PCT7US99/30503 



u u 



o to 

CO CO 





SUBSTITUTE SHEET (RULE 26) 



WO 00/37641 



46 / 54 



PCT/US99/30503 




SUBSTITUTE SHEET (RULE 26) 



WO 00/37641 — PCT/US99/30503 

47 / 54 



Partial intron/eion structure of the VEGF-X gene 
(A) - Genomic DNA sequences of 2 exons determined by sequencing 



tttcttttataccatatagtgc-cgatctgaaccagGGTTCTG» 

AGTCCTTCAGTGCTACCCCCTTC^GCTTTGCCACTGGACCTGCTTAATAATGCTATAACT^ 
TCGATATCTTGAACCAGAGAGATC^CAGTTGGACT^ 

TTGGAAGAAAATCCAGAGTGGTGGATCTGAACCTTCTAACAGAGGAGGTAAGATTATACAGCTGCAC^ 
TCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACT 
TTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACCACGAGg C agg tatacaatttcctcttt 
ggtetccttcgggtatttcatgrcce 

aaagccagtcatagacactcgttgacttccaaaagtggcttactcttattccctcccagGTCCTTCAGTTGAGACCAAAGACCGGT 
GTCAGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGAGTGTG^ 

ATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCG 
CCTTAATCTCAGTTGTTTGCTTCAACX3ACCTTTCAT 

GAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGT 

AAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCT 

GGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCT 

CTAAAATCGTATAAAATCTGG A TT I i riT TT TT T nT r r GCGCAT^^ 

TGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCATGCTGATA 

AATTTCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGAAGAGATAAACCTGAAAAGAA 

TCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCTm 

AATATATCTATTTTTACCAAAGGTATTTAATATTCTTTTTTATGACAACTTAGATCAACTATTTT^ 

ACACAATTGTTATAGCCAGAGGAJ\CAAAGATGATATAAAATATTGTTGCTCTGACAA 

CTAGAGTTAGATTAATCTGCATTTTAA 

TCATATCTTCCATTCCTGTTATT^ 

TCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAA^ 
TGCTGTGCAGTAGGAACACATCCTATTTATTGTGATGTTGTC<3TTTTATTATOT 

TGG ATATTTTTATGTACAGAAGTATGTCTCTTAACCAGTT CACTTATTGTACTCTGG CAATTTAAAAGAAAAT CAGTAAAATATTT 

TOCTTGTAAAATGCTTAATATCGTGCCTAGGTTATGTGGTGACTATTTGAATCAAAAATGTATTGAA 

GGCTATTTTGGGGAGAAAATTargcgcgcgcgtgctcaagatccacttctcggaccccgagaaaacgaaagataaa 
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(B) - Location of splice sites within the cDNA sequence 

2 GAATTCGCCC TTTTGTTTAA ACCTTGGGAA CTGGTTCAGG TCCAGGTTTT GCTTTGATCC 

61 TTTTCAAAAA CTGGAGA CA C AGAAGAGGGC TCTAGGAAAA AGTTTTGGAT GGGATTATGT 

121 GGAAACTACC CTGCGATTCT CTGCTGCCAG AGCAGGCTCG GCGCTTCCAC CCCAGTGCAG 

181 CCTTCCCCTG GCGGTGGTGA AAGAGACTCG GGAGTCGCTG CTTCCAAAGT GCCCGCCGTG 

♦3 MSLFGLLLLTS 
241 AGTGAGCTCT CACCCCAGTC AG C CAAATGA GCCTCTTCGG GCTTCTCCTG CTGACATCTG 

+3 A L A G Q R Q G T 0 A ESN L S S K F Q 
301 CCCTGGCCGG CCAGAGACAG GGGA CTCAGG CGGAATCCAA CCTGAGTAGT AAATTCCAGT 

+3 F S S N X E 0 N G V Q D P Q HER I I T 
361 TTTCCAGCAA CAAGGAACAG AACGGAGTAC AAGATCCTCA GCATGAGAGA ATTATTACTG 

+3 V S T N G S.I H S P R FPU T Y P R N T 
421 TGTCTACTAA TGGAAGTATT CACAGCCCAA GGTTTCCTCA TACTTATCCA AGAAATACGG 

+3 V L V W R L V A V E E N V W I Q Is T F D 
481 TCTTGGTATG GAGATTAGTA GCAGTAGAGG AAAATGTATG GATACAACTT ACGTTTGATG 

+3 E R F G LED P E D D X C K Y D F V E V 
S41 AAAGATTTGG GCTTGAAGAC CCAGAAGATG ACATATGCAA GTATGATTTT GTAGAAGTTG 

+3EEPS DGT ILGR W C G S G T V P G 
6Q1 AGGAACCCAG TGATGGAACT ATATTAGGGC GCTGGTGTGG TTCTGGTACT GTACCAGGAA 

+3 K Q 1 S K G N 0 I R I R F V S D E Y F t P 
€61 AACAGATTTC 7AAAGGAAAT CAAATTAGGA TAAGATTTGT ATCTGATGAA TATTTTCCTT 

+3 S E P |g F C I KYNI VMP .QFT EAV 
721 CTGAACCAGG GTTCTGCATC CACTACAACA TTGTCATGCC ACAATTCACA GAAGCTGTGA 

+3SPSV L ? P SALP LDI# L, N N AIT 
781 GTCCTTCAGT GCTACCCCCT TCAGCTTTGC CACTGGACCT GCTTAATAAT GCTATAACTG 

+ 3AFST LSD LIRY L» E P E R W QLD 
841 CCTTTAGTAC CTTGGAAGAC CTTATTCGAT ATCTTGAACC AGAGAGATGG CAGTTGGACT 

+ 3LEDL Y R P TWQL IiGK AFV FGR 
901 TAGAAGATCT AT AT AGO CCA ACTTGGCAAC TTCTTGGCAA GGCTTTTGTT TTTGGAAGAA 

+ 3KSRV VDL N'LLT E A/V R L Y S C T 
961 AATCCAGAGT C-GTGGATCTG AACCTTCTAA CAGAGGA&7 AAGATTATAC AGCTGCACAC 

+ 3PRNF 5 V S IREE.LKR TDT IFW 
1021 CTCGTAACTT C7CAGTSTCC ATAAGGGAAG AACTAAAC-AG AACCGATACC ATTTTCTGGC 

+ 3PGCL LVK RCGG NCA CCL HNC 
10 81 CAGGTTGTCT CCTGGTTAAA CGCTGTGGTG GGAACTGTGC C7GTTGTCTC CACAATTGCA 



+ 3NECQ CVP SKVT K K Y KE 
1141 ATGAATGTCA ATGTGTCCCA AGCAAAGTTA C7AAAAAATA CCACGAC 



V L 0 L 
JTC CTTCAGTTGA 
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+ 3RPKT GVR GLHK SLT D V A LEH 
1201 GACCAAAGAC CGGTGTCAGG. GGATTGCACA AATCACTCAC CGACGTGGCC CTGGAGCACC 

+3HEEC DCV CRGS TGG 
1261 ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG ATAGCCGCAT CACCACCAGC 

1321 AGCTCTTGCC CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT 

1381 CTCCATCCTT AATCTCAGTT GTTTGCTTCA AGGACCTTTC ATCTTCAGGA TTTACAGTGC 

1441 ATTCTGAAAG AGGAGACATC AAACAGAATT AGGAGTTGTG CAACAGCTCT TTTGAGAGGA 

1501 GGCCTAAAGG ACAGGAGAAA AGGTCTTCAA TCGTGGAAAG AAAATTAAAT GTTGTATTAA 

1561 ATAGATCACC AGC7AGTTTC AGAGTTACCA TGTACGTATT CCACTAGCTG GGTTCTGTAT 

1621 TTCAGTTCTT TCGATACGGC TTAGGGTAAT GTCAGTACAG GAAAAAAACT GTGCAAGTGA 

1681 GCACCTGATT CCGTTGCCTT GCTTAACTCT AAAGCTCCAT GTCCTGGGCC TAAAATCGTA 

1741 TAAAATCTGG ATTTTTTTTT TTTTTTTTTG CTCATATTCA CATATGTAAA CCAGAACATT 

1801 CTATGTACTA CAAACCTGGT TTTTAAAAAG GAACTATGTT GCTATGAATT AAACTTGTGT 

1361 CATGCTGATA GGAtAGACTG GATTTTTCAT ATTTCTTATT AAAATTTCTG CCATTTAGAA 

1921 GAAGAGAACT ACATTCAXGG TTTGGAAGAG ATAAACCTGA AAAGAAGAGT GGCCTTATCT 

1981 TCACTTTATC GATAAGTCAG TTTATTTGTT TCATTGTGTA CATTTTTATA TTCTCCTTTT 

2041 GACATTATAA CTGTTGGCTT TTCTAATCTT GTTAAATATA TCTATTTTTA CCAAAGGTAT 

2101 TTAATATTCT TTTTTATGAC AACTTAGATC AACTATTTTT AGCTTGGTAA A7TTTTCTAA 

2161 ACACAATTGT TATAGCCAGA GGAACAAAGA TGATATAAAA TATTGTTGCT CTGACAAAAA 

2221 TACATGTATT TCATTCTCGT ATGGTGCTAG AGTTAGATTA ATCTGCATTT TAAAAAACTG 

2281 AATTGGAATA GAATTGGTAA GTTGGAAAGA CTTTTTGAAA ATAATTAAAT TATCATATCT 

23 41 TCCAT7CCTG TTATTGGAGA TGAAAATAAA AAG CAACTT A TGAAAGTAGA CATTCAGATC 
2401 CAGCCATTAC TAACCTATTC CTTTTTTGGG GAAATCTGAG CCTAGCTCAG AAAAACATAA 

24 61 AGCACCTTGA AAAAGACTTG GCAGCTTCCT GATAAAGCGT GCTGTGCTGT GCAGTAGGAA 
2521 CACATCCTAT TTATTGTGAT GTTGTGGTTT TATTATCTTA AACTCTGTTC CATACACTTG 

25 31 TAT AAA 7 AC A TGGATATT7T TATGTACAGA AGTATGTCTC TTAACCAGTT CACTTATTGT 
2641 ACCTGGAAGG GCGAATTCTG CAGATATC 
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A n nl tc; mi's tir;i jail's 

lifcrclcrcncc * B0192/7011WO 



InlcrnalioiKil application No. 
PCT/US99/30503 



INDICATIONS RELATING TO DEPOSITED MICROORGANISM 
OR OTHER BIOLOGICAL MATERIAL 

(PCTRule !3/>/.v) 



A. The indications made below relate to the deposited microorganism or other biological material referred to in the description 
21 .line 15-16 



on page 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet | | 



Name of depositary institution 

BELGIAN COORDINATED COLLECTIONS OF MICROORGANISMS (BCCM)™ 
LABORATORIUM VOOR MOLECULAIRE BIOLOGIE - PLASMIDENCOLLECTIE (LMBP) 



Address of depositary institution (including postal code and country) 

Universiteit Gent 

K.L. Ledeganckstraat 35 

B-9000 Gent, Belgium 



Date of deposit 

20 December 1999 (20.12.99) 



Accession Number 
LMBP 3991 



C- ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States} 



E. SEPARATE FURNISHING OF INDICATIONS (have blank if not applicable) 



fhc indications listed below will be submitted to the International Bureau later (s^cify- the general nai are of rite indiccaiom e.g.. "Accession 
Ntunber of De/Htsit") 



For receiving Office use only 



1 1 This sheet was received with the international application 



Authorized officer 



\yC\ This sheet was received bv the J ntc^at icirml^ureau on: 

.19 APRIL 2000 < 19^11 


Authorized officer £U en MoySS \ 
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LMBP-COLLECTION 

Page 1 of Form BCCM™/LMBP/BP/4/99-23 Receipt in the case of an original deposit 



Budapest Treaty on the International Recognition of the Deposit of Microorganisms for 

the Purposes of Patent Procedure 

Receipt in the case of an original deposit issued pursuant to Rule 7.1 by the 
International Depositary Authority BCCM™/LMBP identified at the bottom of next page 

international Form BCCM™/LMBP/BP/4/99-23 



To : Name of the depositor : Janssen Pharmaceutica N.V. 



Address : Turnhoutseweg 30 

B-2340 Beerse 
Belgium 



I. Identification of the microorganism: 

1.1 Identification reference given by the depositor: 

VEGF-X CUB PET22b 



1.2 Accession number given by the International Depositary Authority: 



LMBP 3991 
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BELGIAN COORDINATED COLLECTIONS OF MICROORGANISMS - BCCM™ 
LMBP-COLLECTION 

Page 2 of Form BCCM™/LMBP/BP/4/99-23 Receipt in the case of an original deposit 

II. Scientific description and/or proposed taxonomic designation 
The microorganism identified under I above was accompanied by: 

v (mark with a cross the applicable box(es)) 

- a scientific description yes no |"1 

- a proposed taxonomic designation yes □ no [3 

III. Receipt and acceptance 

This International Depositary Authority accepts the microorganism identified under I 
above, which was received by it on (date of original deposit) : December 20, 1999 



IV. International Depositary Authority 



Belgian Coordinated Collections of Microorganisms (BCCM™) 

Laboratorium voor Moleculaire Biologie - Plasmtdencoliectie (LMBP) 

Universiteit Gent 

K.L. Ledeganckstraat 35 

B-9000 Gent, Belgium 



Signature(s) of person(s) having the power to represent the International Depositary 
Authority or of authorized official(s): 




Date : January 12, 2000 



Marline Vanhoucke 
BCCM/LMBP curator 
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LMBP-COLLECTION 

Page 1 of Form BCCM™/LMBP/BP/9/99-23 Viability statement 

Budapest Treaty on the International Recognition of the Deposit of Microorganisms for 

the Purposes of Patent Procedure . 

Viability statement issued pursuant to Rule 10.2 by the International Depositary 
Authority BCCM™/LMBP identified on the following page 

International Form BCCM™/LMBP/BP/9/99-23 



To : Party to whom the viability statement is issued: 
Name : Dr Filip De Corte 

Address : Janssen Pharmaceutica N.V. 

Turnhoutseweg 30 
B-2340 Beerse 
Belgium 

I. Depositor 

1.1 Name : Janssen Pharmaceutica N.V. 



I.2 Address : Turnhoutseweg 30 

B-2340 Beerse 
Belgium 



IL Identification of the microorganism: 

11.1 Accession number given by the International Depositary Authority: 



LMBP 3991 



II. 2 Date of the original deposit (or where a new deposit or a transfer has been 
made, the most recent relevant date) : December 20, 1999 

III. Viability statement. 

The viability of the microorganism identified under II above was tested on 

: January 1 1 , 2000 

(Give date. In the cases referred to in Rule 10.2(a)(ii) and (iii), refer to the most recent 
viability test). 

On that date, the said microorganism was: (mark the applicable box with a cross) 

[X] viable 

I I no longer viable 



PCT/US99/30503 



BELGIAN COORDINATED COLLECTIONS OF MICROORGANISMS - BCCM™ 
LMBP-COLLECTION 

Page 2 of Form BCCM™/LMBP/BP/9/9?-23 Viability statement 



IV. Conditions under which tne viability test has been performed: 

(Fill in if the information has been requested and if the results of the test were 
negative). 



V. International Depositary Authority 



Belgian Coordinated Collections of Microorganisms (BCCM 1 * 1 ) 

Laboratorium voor Moleculaire Biologie - Plasmidencollectie (LMBP) 

Universiteit Gent 

K.L. Ledeganckstraat 35 

B-9000 Gent, Belgium 



Signature(s) of person(s) having the power to represent the Internationa! Depositary 
Authority or of authorized official(s); 




Date : January 1 2, 2000 



Martine Vanhoucke 
BCCM/LMBP curator 



